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Preface to the First Edqition 


They throw geometry out the door and it comes back through the win- 
dow. 
(H.G.Forder Auckland 1973, reading new mathematics at the age of 84) 


The subject of this book is numerical methods that preserve geometric properties of 
the fow of a differential equation: Symplectic integrators for Hamiltonian Systems， 
Symmetric integrators for reversible Systems, methods preserving first integrals and 
numerical methods on manifolds, including Lie group methods and integrators for 
constrained mechanical Systems, and methods for problems with highly oscillatory 
Solutions. Structure preservation - with its questions as to where, how, and what for 
-js the unifying theme. 

In the last few decades, the theory of numerical methods for general (non-stiff 
and stifft) ordinary differential equations has reached a certain maturity, and excel- 
lent general-purpose codes, mainly based on Runge-Kutta methods or linear mul- 
tistep methods, have become available. The motivation for developing Structure- 
preserving algorithms for Special classes of problems came Independently from such 
different areas of research as astronomy, molecular dynamics, mechanics, theoreti- 
cal physics, and numerical analysis as well as from other areas of both applied and 
pure mathematics. It turned out that the preservation of geometric properties of the 
fow not only produces an Improved qualitative behaviour, but also allows for a more 
accurate long-time integration than with general-purpose methods. 

An important Shift of view-point came about by ceasing to concentrate on the 
numerical approximation of a Single solution trajectory and instead to consider a 
numerical method as a discrele dyma11aical syste1 Which approximates the fow of 
the differential equation - and so the geometry of phase Space comes back again 
through the window. This view allows a clear understanding of the preservation of 
invariants and of methods on manifolds, of Symmetry and reversibility of methods， 
and of the Symplecticity of methods and various generalizations. These Subjects are 
presented in Chapters IV through VII of this book. Chapters Ithrough II are of an 
introductory nature and present examples and numerical integrators together with 
important parts of the classical order theories and their recent extensions. Chapter 
VJII deals with questions of numerical Implementations and numerical merits of the 
various Imethods. 

It remains to explain the relationship between geometric properties of the nu- 
merical method and the favourable error propagation in long-time integrations. This 
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is done using the idea of packwal1d error analysISs，wWhere the numerical one-step 
map is interpreted as (almost) the fow of a modified differential equation，which is 
constructed as an asymptotic series (Chapter IX). In this way, geometric properties 
of the numerical integrator translate into Structure preservation on the level of the 
modified equations. Much inslight and Trigorous error estimates over long time in- 
tervals can then be obtained by combining this backward error analysis with 天 AM 
theory and related perturbation theories. This ls explained in Chapters X through 
XII for Hamiltonian and reversible Systems. The final Chapters XIII and XIV treat 
the numerical solution of differential equations with high-frequency oscillations and 
the long-titme dynamics of multistep methods, respectively. 

This book grew out of the lecture notes of a course given by Ernst Hairer at 
the University of Geneva during the academic year 1998/99. These lectures were 
directed at Students in the third and fourth year. The reactions of Students as well 
as of many colleagues，who obtained the notes from the Web, encouraged us to 
elaborate our ideas to produce the present monograph. 

We want to thank all those who have helped and encouraged us to prepare this 
book. In particular, Martin Hairer for his valuable help in installing computers and 
his expertise in Latex and Postscript，Jeff Cash and Robert Chan for reading the 
whole text and correcting countless Scientifc obscurities and linguistic errors, Haruo 
Yoshida for making many valuable Suggestions, StEphane Cirilli for preparing the 
files for all the photographs, and Bernard Dudez, the irreplaceable director of the 
mathematics library in Geneva. We are also grateful to many friends and colleagues 
for reading parts of the manuscript and for valuable remarks and discussions，in 
particular to Assyr Abdulle, Melanie Beck, Sergio Blanes, John Butcher, Mari Paz 
Calvo, Begofia Cano, Philippe Chartier David Cohen, Peter Deuflhard, Stig Faltin- 
Sen，Francesco Fasso, Martin Gander, Marlis Hochbruck, Bulent Karasozen，VWil- 
helm 天 aup，Ben Leimkuhler, Pierre Leone，Frank Loose, Katina Lorenz，Robert 
MecLachlan，Ander Murua，Alexander Ostermann，Truong Linh Pham，sSebastian 
Reich, Chus Sanz-Serna, Zaijiu 9hang, Yifa Tang, Matt West, Will Wiight. 

We are especially grateful to Thanh-Ha Le Thi and Dr Martin Peters from 
Springer-Verlag Heidelberg for assistance, in particular for their help in getting most 
of the original photographs from the Oberwolfach Archive and from Springer New 
York, and for clarifying doubts concerning the copyright. 


Geneva and Tubingen, November 2001 The Authors 


Preface to the Second Edqition 


The fast development of the subject - and the fast development of the sales of the 
first edition of this book - has given the authors the opportunity to prepare this Sec- 
ond edition. First of all we have corrected several misprints and minor errors which 
we have discovered or which have been kindly communicated to us by several read- 
ers and colleagues. We cordially thank all of them for their help and for their interest 
in oOUr work. A major point of confusion has been revealed by Robert McLachlan in 
his book review in SIAM Reviews. 


Besides many details，which have improved the presentation throughout the 


book, there are the following major additions and changes which make the book 
about 130 pages longer: 


a more prominent place of the Stormer-Verlet method in the exposition and the 
exXamples of the first chapter; 

adiscussion of the HEnon-Heiles model as an example of a chaotic Hamiltonian 
System 

a new Sect.IV.9 on geometric numerical linear algebra considering differential 
equations on Stiefel and Grassmann manifolds and dynamical low-rank approxi- 
Imations， 

anew improved composition method of order 10 in Sect. V.3; 

a characterization of B-series methods that conserve quadratic first integrals and 
a criterion for conjugate Symplecticity in Sect. VI.8; 

the Section on Volume preservation taken from Chap. VJI to Chap. VT; 

an extended and more coherent Chap. VIL renamed Non-Canonical Hamiltonian 
Systems, with more emphasis on the relationships between Hamiltonian Systemas 
on manifolds and Poisson Systems; 

a completely reorganized and augmented Sect. VII.3 on the riglid body dynamics 
and Lie-Poisson Systems; 

anew ect.VII.6 on reduced Hamiltonian models of quantum dynamics and Pois- 
Son integrators for their numerical treatment; 

an improved step-Size control for reversible methods in Sects.VII.3.2 and IX.0; 
extension of Sect.IX.3 on modified equations of methods on manifolds to include 
constrained Hamiltonian Systems and Lie-Poisson integrators; 

reorganization of Sects.IX.9 and IX.10; study of non-Symplectic B-series meth- 
ods that haveamodified Hamiltonian, and counter-examples for symmetric meth- 
ods Showing linear growth in the energy error; 
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- amore precise discusSion of integrable reversible Systems with new examples in 
Chap. XI; 

- extenslion of Chap.XII on highly oscillatory problems to Systems with Several 
Constant fredquencies and to Systems with non-constant mass InatriX; 

- a new Chap.XIV on oscillatory Hamiltonian Systems with time- Or Solution- 
dependent high frequencies, emphasizing adiabatic transformations, adiabatic in- 
variants, and adiabatic integrators; 

- a completely rewritten Chap.XV with more emphasis on linear multistep meth- 
ods for second order differential equations; a complete backward error analysis 
including parasitic modified differential equations; a Study of the long-time sta- 
bility and arigorous explanation ofthe long-time near-conservation of energy and 
angular momentum， 


Let us hope that this second revised edition will again meet good acceptance by OUT 
Teaders. 


Geneva and Tubingen, October 2005 The Authors 
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Chapter 工 . 
Examples and Numerical 上 xperiments 


This chapter introduces Some interesting examples of differential equations and i- 
lustrates different types of quajitative behaviour of numerical methods. We deliber- 
ately consider only very Simple numerical methods of orders 1 and 2 to emphasize 
the quajitative aspects of the experiments. The Same effects (on a different Scale) 
OoccuUT with more Sophisticated higher-order integration Schemes. The experiments 
presented here Should serve as a motivation for the theoretical and practical inves- 
tigations of later chapters. The reader is encouraged to repeat the experiments or to 
invent Similar ones. 


II.1 First Problems and Methods 


Numerical applications of the case of two dependent variables are not 
easily obtained. (A.J. Lotka 1925, p.79) 


Our first problems, the Lotka-Volterra model and the pendulum equation, are dif- 
ferential equations in two dimensions and show already many interesting geometric 
properties. Our first methods are various variants of the Euler method, the midpoint 
rule, and the Stormer-Verlet Scheme. 


I.1.1 The Lotka-yvolterra Model 


We start with an equation from mathematical biology which models the growth of 
animal species. If areal variable vw(t is to represent the number of individuals of a 
certain Species at timet, the simplestassumption about its evolution is du/dt = 凡 aQ， 
Where Q is the reproduction rate. A constant a leads to exponential growth. In the 
case of more Species living together the reproduction rates will also depend on 
the population numbers of the ofper Species. For example, for two Species with 
u(t denoting the number of predators and v( 芭 the number of prey, a plausible 
asSumption is made by the ZLotka-yWlierra 1ode/ 


&(u 一 2) 
u 人 (1 一切 )， 


以 


(1.D 


where the dots on and vw stand for differentiation with respect to time. (We have 
chosen the constants 2 and 1 in (1.1) arbitrarily.) A.J. Lotka (1925, Chap. VIU) used 
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Fig. 1.1. Vector field, exact fow, and numerical fow for the Lotka-Volterra model (1.1) 


this model to study parasiltic invaslion of insect Species, and, with its help, V. Volterra 
(1927) explained curious fishing data from the upper Adriatic Sea following World 
War 工 

Equations (1.1) constitute an autonomous System of differential equations. In 
general, we write Such a System in the form 


= jy) . (1.2) 


Every Vy represents a point in the pjpase space, in equation (1.1) above y = (u) 
is in the phase plane 月 2. The vector-valued function (yy) represents a vector jie1d 
which, at any point of the phase Space, prescribes the velocity (direction and Speed) 
of the solution y(t) that passes through that point (see the first picture of Fig. 1.1)， 
For the Lotka-Volterra model, we observe that the System cycles through three 
stages: (]) the prey population increases; (2) the predator population increases by 
feeding on the prey; (3) the predator population diminishes due to lack of food. 


Flow of the System. A fundamental concept is the Jow over time 纪 This is the 
mapping which, to any point yo in the phase space, associates the value y( 芭 of the 
solution with initial value V(0) = yo. This map, denoted by pr, is thus defined by 


ot(yo) =2 坟 让 0) =2%o0. (1.3) 


The second picture of Fig. 1.1 Shows the results of three iterations of pf (with 上 一 
1.3) for the Lotka-Volterra problem, for a set of initial values yo = (wo,vo) forming 
an animal-shaped set 4.1 


JInvariants. If we divide the two equations of (1.1) by each other, we obtain a single 
equation between the variables vv and v. After Separation of variables we get 


工 一 亿 4 一 2 dg 
一 / ) 一 了 
0 国人 六 (2 U) 


1 This cat came to fame through Arnold (1963). 
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Where 
T(wU) = 一 nu 一 十 21nv 一， (1.4) 


So that 7(u( 坟 ,v 坟 ) = Comst for all 二 We call the function 7 an ivariant of the 
System (1.1). Every solution of (1.1) thus lies on a level curve of (1.4). Some of 
these curves are drawn in the pictures of Fig.1.1. Since the level curves are closed， 
all solutions of (1.1) are periodic. 


I.1.2 First Numerical Methods 


上 xplicit Euler Method. The simplest of all numerical methods for the System (1.2) 
is the method formulated by Euler (17068)， 


yn+l 三 yn 十 九 帮 yn). (1.3) 


It uses a constant Step Size 丸 to compute, one after the other approximations V1, V2， 
V3a,，.. .to the values y(j), V(27), V(3A)，... of the solution starting from a given 
initial value V(0) = yo. The method is called the explicir Euler ae 如 od, because 
the approximation yn+l is computed using an explicit evaluation of j 太 at the already 
known value yn. Such a formula represents a mapping 


9 :UVm FF > Zn 十 1， 


Which we call the discrete or 71U1aierical jiow. Some iterations of the discrete How for 
the Lotka-Volterra problem (1.1) (with 下 三 0.5) are represented in the third Picture 
of Fig. 1.1. 


JImplicit Euler Method. The zzPlicit Exier etpod 


yn+l 三 Vn 十 尺 帮 yn+T1)， (1.0) 


is known for its all-damping Stability properties. In contrast to (1.5), the approx- 
imation yn+1 is deftined implicitly by (1.6), and the implementation requires the 
numerical solution of a nonlinear System of equations. 


JImplicit Midpoint Rule. Taking the mean of y, and yn+linthe argument of 矿 we 
get the 77DLicit 121CDoIOL ULe 


2m 十 Zn 十 1 ) 


(1.7) 


Vn+1 三 ynr 十 pf 
It is a yy1z111etjic method, wphich means that the formula is left unaltered after ex- 
changing yn 全 yn+l and 几 人 一 六 (more on Symmetric methods in Chap. V). 


Symplectic Euler Methods. For Pay7titio7zed Systemas 


勾 一 al(u) 48) 


= Www， 
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SA NA 








explicit Euler 








Fig. 1.2. Solutions of the Lotka-Volterra equations (1.1) (step Sizes 几 三 0.12; initial values 
(2,2) for the explicit Euler method, (4, 8) for the implicit Euler method, (4, 2) and (6, 2) for 
the Symplectic Euler method) 


Such as the problem (1.1), we consider also Partitiomed Euler methods 


nl 三 Un 十 /an Un nl 三 Un 十 尹 a(Un 1 Un 
1 人 +1) 二 1 (un+1) Un) 41.9) 


Un+l 二 Un 十 PLD(2mn， Un 二 1)， Vn+Ll 二 Un 下 PLD(2n 1 Un )， 














which treat one variable by the implicit and the other variable by the explicit Euler 
method. In view of an Important property of this method, discovered by de Vogelaere 
(1936) and to be discussed in Chap.VIL, we call them sy1zPlectic 已 ULer 1Ietjpody. 


Numerical xample for the Lotka-Volterra Problem.， Our first numerical experT- 
iment Shows the behaviour of the various numerical methods applied to the Lotka 一 
Volterra problem. In particular, we are interested in the preservation of the invariant 
7 over long times. Fig. 1.2 plots the numerical approximations of the first 123 steps 
with the above numerical methods applied to (1.1), all with constant Step SizeSs. We 
observe that the explicit and implicit Euler methods show wrong quajlitative be- 
haviour. The numerical solution either Spirals outwards or inwards. The symplectic 
Euler method (implicit in vv and explicit in v), however, gives a numerical solution 
that lies apparently on a closed curve as does the exact Solution. Note that the curves 
of the numerical and exact Solutions do not coincide. 


I.1.3 The Pendulum as a Hamiltonian System 


A great deal of attention in this book will be addressed to Hamiltonian propblems， 
and ouUr next examples will be of this type. These problems are of the form 


六 = 一 瑟 v(D, 9)， = 万 (P,9)， (1.10) 


Where the 瑟 azazltomia1n 殉 (Di,... ,Du;9ql, .dd) Tepresents the total energy; 9i are 
the position coordinates and Pi the momenta for z = 1,...,q, with wd the number of 
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degrees of freedom; 瑟 p and 克 are the vectors of partial derivatives. One verifies 
easily by differentiation (See Sect.IV.l) that, along the solution curves of (1.10)， 


瓦 (pg() = Const (1.11) 


lj.e., the Hamiltonian ls an invariant or a jjst 171eg1a/. More details about Hamil- 
tonian Systems and their derivation from Lagrangian mechanics will be given in 
Sect. VI.1. 


Pendulum. The mathematical pendulum (mass 7 三 1， 





massless rod of length 上 = 1，gravitational acceleration 

0 =J)isasystem with one degree of freedom having the 人 

Hamiltonian 以 
刀 六 人 = 3 天 一 cosy (GI2) ceosg NI 


So that the equations of motion (1.10) become 
厂 王 一 Sin dg， 0 王 7. (1.13) 


Since the vector field (1.13) 18 27r-periodic in q, it is natural to consider dg as a vari- 
able on the circle 91. Hence, the phase space of points (p, g) becomes the cylinder 
及 x 951. Fig. 1.3 shows some level curves of 万 (p, q). By (1.11), the solution curves 
of the problem (1.13) lie on Such level curves. 














exact flow explicit Euler Symplectic Euler 


Fig. 1.3. Exact and numerical fow for the pendulum problem (1.13); step Sizes 琅 三 上 一 工 


Area Preservation. Figure 1.3 (first picture) illustrates that the exact fow of a 
Hamiltonian System (1.10) ls area preserving. This can be explained as follows: the 
derivative of the How wp with respect to initial values (D, 9)， 


2f(D; 9) 一 和 
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satisfies the variational equation ? 


./ 一 所 一 万 
spo=( 四 ”) wpg)， 
已 pp Lp 


Where the second partial derivatives of 妃 are evaluated at wx(Dp,9q). In the case of 
one degree of freedom (Cd = 1), a Simple computation Shows that 


q / dropt odd) op 6d() 
op,9) 一 过 on 0 od pn 】 





至 呈 二 和 寺 : 


Since wo is the identity, this implies det of(p,d) = 1 for all 上 which means that the 
fow pz(p, aq) is an area-preserving mapping. 

The last two pictures of Fig.1.3 Show numerical fows. The explicit Euler 
Imethod is clearly seen not to preserve area but the Symplectic Euler method is (this 
will be proved in Sect.VI.3). One of the aims of geometric integration ls the Study 
of numerical integrators that preserve Such types of quajitative behaviour of the ex- 
act fow. 




















explicit Euler Symplectic Euler Stormer-Verlet 


Fig. 1.4. Solutions of the pendulum problem (1.13); explicit Euler with step Size 尺 一 0.2， 
initial value (po, go) = (0,0.5); symplectic Euler with 玉 = 0.3 and initial values do = 0， 
Do 三 0.7,1.4,2.1; Stormer-Verlet with 下 一 0.6 


Numerical 了 xperiment， We apply the above numerical methods to the pendulum 
equations (See Fig. 1.4). Similar to the computations for the Lotka-Volterra equa- 
tions，we observe that the numerical Solutions of the explicit Euler and of the im- 
plicit Euler method (not drawn in Fig.1.4) Spiral either outwards or inwards. The 
Symplectic Euler method shows the correct qualitative behaviour, but destroys the 
left-right symmetry of the problem. The Stormer-Verlet Scheme, which we discusSs 
next,， works perfectly even with doubled step Size. 


2 As is common in the study of mechanical problems，we use dors for denoting time- 
derivatives, and we use Prinies for denoting derivatives with respect to other variables. 
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Po Jacques Verlet-Banide 





Fig. 1.$. Carl Stormer (left picture), born: 3 September 1874 in Skien (Norway), died: 13 Au- 
gust 1957. 
Loup Verlet (right picture)j, born: 24 May 1931 in Paris 


I.1.4 The Stormer-Verlet Scheme 
The above equations (1.13) for the pendulum are of the form 


= jd) 
4 三 2 


or = jg (1.14) 


which is the Important Special case of a second order differential equation. The most 
natural discretization of (1.14) 1s 


do 一 2 十 on 1 =72Fo)， (1.15) 


which is just obtained by replacing the Second derivative in (1.14) by the central 
Second-order difference quotient. This basic method, or its equivalent formulation 
given below,is called the SOmmaer7aethodin astronomy, the Verlefmmetpod3 in mole- 
cular dynamics, the /eap-jrosg etpod in the context of partial differential equations， 
andithas furthernames in other areas (See Hairer Lubich & Wanner (2003), p. 402). 
C. Stormer (1907) used higher-order variants for numerical computations concern- 
ing the aurora borealis. 二 . Verlet (1967) proposed this method for computations in 
Imolecular dynamics，where it has become by far the most widely used integration 
Scheme. 

Geometrically, the Stormer-Verlet method can be seen as produced by parabo- 
las，which in the points 刀 possess the right second derivative Fw ) (See Fig.1.6 


3 Irony of fate: Professor Loup Verlet who later became interested in the history of science， 
discovered precisely “his”method in Newton's Principia (Book LI figure for Theorem ， 
See Sect.I.2.1 below). 
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区 过 工 tn tn 十 1 











Fig. 1.6. Ilustration for the Stormer-Verlet method 


to the leftb). But we can also think of polygons, which possess the right Slope in the 
midpoints (Fig. 1.0 to the right)， 
Approximations to the derivative D = d are Simply obtained by 


人 and Dll73 到 人 dm (1.16) 
One-Step Formulation. The Stormer-Verlet method admits a one-step formulation 
which is useful for actual computations. The value dan together with the Slope pn and 
the second derivative Fan), all at 如 ,uniquely determine the parabola and hence 
also the approximation (pn+l,qn+i) at t 如 +1. Writing (1.13) as pn+1/2 一 Ph 172 一 
几 Lan) and using pn+l/2 十 pn _1/2 一 2pn ,we getby elimination of either pn+172 
or pn -1/2 the formulae 


Dam 一 





用 
PPm 二 1/2 二 Dm 十 了 jgn) 





dmn+1 二 dmn 下 RDn+I172 (1.17) 





几 
Dmp++Ll 一 PPp 二 172 让 了 jan+1) 


Which is an explicit one-step method 2 : (an,pn) 上 (ai,Dn+i) for the corre- 
Sponding first order System of (1.14). 玉 one is not interested in the values pn of the 
derivative, the first and third equations in (1.17) can be replaced by 


DPn+l1/2 三 pn_1/2 十 九 帮 am) 


TI2 The 天 epler Problem and the Outer Solar System 


Iawoke as if from sleep, anew light broke on me. (J. Kepler; quoted 
from J.L.E.Dreyer 4 1istory o astrono1y，1906, Dover 1953, p.391) 


One of the great achievements in the history of Science Was the discovery of the 
lawsgs of J.Kepler (1609), based on many precise measurements of the positions of 
Mars by TIycho Brahe and himself. The planets move im elizptic orpixs with the Sun 
at one of the foci (Keplers first law) 
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Q 
允 生 一 Q 一 Qecos 耳 ， (2.1) 
工 十 ecos 
(where Qw = great axXis，e 三 eccentricity，D 一 


avV1--e2d =bvV1I-e2 = al 一 e2), 瑟 一 ec- 
centric anomaly, 2 = true anomaly). 

Newton (Principia 1687) then expiaized this 
motion by his general law of gravitational attrac- 
tion (proportional to 1/r2) and the relation between 
forces and acceleration (the “Lex II” of the PrzP- 
cipia). This then opened the way for treating arbi- 
trary celestial motions by solving differential equa- 
tions， 











Two-Body Problem. For computing the motion of two bodies which attract each 
other we choose one ofthe bodies as the centre of our coordinate System; the motion 
will then Stay in a plane (Exercise 3) and we can use two-dimensional coordinates 
d = (qi,q2) for the position of the second body. Newton'”s laws,，with a sujitable 
normalization, then yield the following differential equations 


Re d1 症 d2 
人 大 人 


This is equivalent to a Hamiltonian System with the Hamiltonian 


1 工 . 
瑟 (pi,pa,gql,q) 一 三 (21 十 D2) 一 5 Pi 三 水. (2.3) 


2 QZ 十 0 


I2.1 Angular Momentum and 天 epler”s Second Law 


The system has not only the total energy 万 (pp,q) as a first integral, but also the 
angular momentum 

也 (pl1, pa,91,92) = 91D2 一 92D1. (2.4) 
This can be checked by differentiation and is nothing other than Keplers seco1d 
Law,， which says that the ray EAI Sweeps equal areas in equal times (See the jlittle 
Picture at the beginning of Sect. II.2). 

A beautiful geommetric justification of this law is due to INewton (Principia 
(1687)，Book [L fgure for Theorem D. The idea is to apply the Stormer-Verlet 
Scheme (1.15) to the equations (2.2) (See Fig.2.1). By hypothesis，the diago- 
nal of the parallelogram dn -ldqndn+l1，which is (ai 一 an) 一 (ao 一 an 1) = 
dan+1 一 2 十 dan li = Const .jd points towards the sun 9. Therefore, the 
altitudes of the triangles on -1qn9 and dl1dqn9 are equal. Since they have the com- 
mon base qn9, they also have equal areas. Hence 


det(qn 1， dm 一 dan_1) det(qn， dm 十 1 一 dan) 
and by passing to the limit 妃 一 0 we see that det(d,D) = Comst. This is (2.4)， 


4 We are grateful to a private communication of 直 . Verlet for this reference 
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Fig. 2.1. Proof of Keplers Second Law (lefb; facsimile from Newton's Principia (rightb) 


We have not only an elegant proof for this invariant,，but we also see that 1je 
Si1O1111e1 一 YeFlet sc1pe11e DJese1Vvey 1Pisy inyaria1t Jor every 天 > 0. 


II.2.2 了 Exact Integration of the 天 epler Problem 


Pour voir prEsentement que cette courbe 4BPC ...esttobnjours une Sec- 
tion Conique, ainsi que Mr Newton Ta Suppose, pa8. 95. Co1o11.7 Sans le 
d6montrer; il y faut bien plus dadresse: (Joh. Bernoulli 1710, p.475) 


Itis now interesting, inversely to the procedure of Newton, to prove that a1zy Solution 
of (2.2) follows either an elliptic, parabolic or hyperbolic arc and to describe the 
Solutions analytically. This was first done by Joh. Bernoulli (1710, full of sarcasm 
against Newton), and by Newton (1713, second edition of the Pincipia，without 
mentioning a word about Bernoulli). 

By (2.3) and (2.4), every solution of (2.2) Satisfies the two relations 


| 
.2 : 2 
如 ) - -一 一 也， d192 一 0q291 三 了 0， (2.3) 
VGTE 十 邓 


Where the constants 万 0o and ZLo are determined by the initial values. USing polar 
coordinates qd1 一 7 cos %, q2 一 7 Sin ,this System becomes 





1 1 
人 7r20 一 了 工 0. (2.60) 
了 
For its solution we consider r as afunction of po and write 7 一 外 : 2. The elimina- 
tion of 2 in (2.0) then yields 
1 dr\ 2 人 八 1 
= | | 一 一 一 一 一 卫 0. 
2 (( 呈 ) 】 和 " 
In this equation we use the substitution r = 1/u, dr = 一 qu/uw2 ,which gives (with 
一 dj/d0) 
| /12 了 人 万 0 
| 一 0， 229 
(十 ) 项 过 肖 (2.7) 


This is a "Hamiltonian ”for the System 


[2 The Kepler Problem and the Outer Solar System 11 





1 1 1 (OOD 一 0 
岂 十 人 一 二 ie.，V 一 二 十 clcoso 十 cosinw 一 人 (2.8) 
Q Q Q 
where Qw 一 了 8 and the constant e becomes, from (2.7)， 
e2 一 1 十 2 (2.9) 


(by Exercise 7, the expression 1 十 2 瑟 0o7i is non-negative). This is precisely formula 
(2.1). The angle ”jls determined by the initial values ro and wo. Equation (2.1) 
Tepresents an elliptic orbit with eccentricity e for 五 0 < 0 (See Fig.2.2, dotted line)， 
aparabola for 万 o = 0, and a hyperbola for 互 0 > 0. 

Finally，we must determine the variables 7 and % as functions of 上 With the 
relation (2.8) and7 = 1/uw, the second equation of (2.6) gives 


C2 





dp 一 7 地 (2.10) 


(1 +ecos(P 一 3)) 


which, after an elementary, but not easy, integration, Tepresents an implicit equation 
for 2 信 ， 

















400 000 steps 
尺 王 0.0005 
上 | 上 上 上 | 上 
一 2 1 
4000 steps 
Symplectic Euler 六 一 0.05 
implicit midpoint 4000 steps 四 4000 steps 
九 王 0.05 尺 王 0.05 








Stormer-Verlet 

















Fig. 2.2. Numerical solutions of the Kepler problem (eccentricity e 一 0.6; in dots: exact 
Solution) 
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I2.3 Numerical Integration of the 玉 epler Problem 


For the problem (2.2) we choose, with0 和 e < 1, the initial values 


工 
na0=1-e oO=0 90=0 290=V/ 2 CD 





This implies that 有 Fo = -1/2, 10 =V1I--e2dg=1-e2ando* =0.Theperiod 
of the solution is 27 (Exercise 9). Fig.2.2 Shows Some numerical solutions for the 
eccentricity e 三 0.6 compared to the exact Solution. After our previous experience， 
it is no Jonger a Surprise that the explicit Euler method Spirals outwards and gives a 
completely wrong answer. For the other methods we take a Step Size 100 times larger 
in order to “see Something”. We see that the nonsymmetric Symplectic Euler method 
distorts the ellipse, and that all methods exhibit a Precessio1 effect, clockwise for 
Stormer-VYerlet and Symplectic Euler anti-clockwise for the implicit midpoint rule. 
The Same behaviour occurs for the exact Solution of Perturped Kepler problems 
(Exercise 12) and has occupied astronomers for centuries. 

OUT next experiment (Fig.2.3) studies the conservation of invariants and the 
global error. The main observation ls that the error in the energy grows linearly for 
the explicit Euler method, and it remains bounded and Small (no secular terms) for 
the Symplectic Euler method. The global error measured in the Euclidean norm， 
Shows a quadratic growth for the explicit Euler compared to a linear growth for 
the Symplectic Euler As indicated in Table 2.1 the implicit midpoint rule and the 
Stormer-Verlet Scheme behave Similar to the Symplectic Euler, but have a Smaller 


上 conservation of energy 
.02 上 explicit Euler, 浆 三 0.0001 


.01 
提 Symplectic Euler 六 = 0.001 


100 











global error of the solution 








explicit Euler 浆 = 0.0001 


Symplectic Euler 六 = 0.001 








Fig. 2.3. Energy conservation and global error for the Kepler problem 
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Table 2.1. Qualitative long-time behaviour for the Kepler problems; t is time, 彤 the step Size 























method errorin 如 | errorin 亏 | global error 
explicit Euler O(t) O(t) O( 妇 门 
Symplectic Euler O(7) 0 O( 雪 ) 
implicit midpoint O(12) 0 O(tp2) 
Stormer-Verlet O(12) 0 O(tp2) 





error due to their higher order. We remark that the angular momentum 厂 (p, 9q) is ex- 
actly conserved by the Symplectic Euler the Stormer-VYerlet, and the implicit mid- 
point rule. 


I.2.4 The Outer Solar System 


The evolution of the entire planetary System has been numerically in- 

tegrated for a time span of nearly 100 million years5. This calculation 

confirms that the evolution of the Solar System as a whole is chaotic, . . . 
(G.J.Sussman 儿 J. Wisdom 1992) 


We next apply our methods to the System which describes the motion of the five 
outer planets relative to the Sun. This System has been studied extensively by as- 
tronomers. The problem is a Hamiltonian System (1.10) (W-body problem) with 


ao9- 了 寺 zn- cy 末 ee C.12) 


乞 乞 不 = 省 


Here p and dg are the Supervectors composed by the vectors pi, oi E 了 肥 3 (momenta 
and positions), respectively. The chosen units are: masses relative to the Sun, So that 
the Sun has mass 1. We have taken 


100 一 1.00000597682 


to take account of the inner planets. Distances are in astronomical units (1 [A.U.] 三 
149 597870 [kmj]), tmes in earth days, and the gravitational constant is 


G = 2.95912208286 . 10-4. 


The initial values for the sun are taken as qo(0) = (0,0,0) and do(0) = (0;0,0) 
All other data (masses of the planets and the initial positions and initial veloci- 
ties) are given in Table 2.2. The initial data is taken from “Ahnerts Kalender far 
Sternfreunde 1994”, Johann Ambrosius Barth Yerlag 1993, and they correspond to 
September 5, 1994 at 0h00.6 


5 100 million years is not much in astronomical time scales; it just goes back to“Jurassic 
Park ”. 
5 We thank Alexander Ostermann, who provided us with this data. 
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Table 2.2. Data for the outer Solar System 



































Planet Iass initial position | initial velocity 
一 3.5023653 0.00565429 

Jupiter 701 一 0.000954786104043 一 3.8169847 |」 一 0.00412490 
一 1.5507963 |」 一 0.00190589 

9.0755314 0.00168318 

Saturn 702 一 0.000285583733151 一 3.0458353 0.00483525 
一 1.6483708 0.00192462 

8.3101420 0.00354178 

Uranus 703 二 0.0000437273164546 | 一 16.2901086 0.00137102 
一 7.2521278 0.00055029 

11.4707666 0.00288930 

Neptune || 7n4 = 0.0000517759138449 | 一 25.7294829 0.00114527 
一 10.8169456 0.00039677 

一 15.5387357 0.00276725 

Pluto ma05 一 1/(1.3.103) 一 25.2225594 | 一 0.00170702 
一 3.1902382 | 一 0.00136504 





explicit Euler 疡 = 10 


implicit Euler 天 = 10 














Fig. 2.4. Solutions of the outer Solar System 





To this System we apply the explicit and implicit Euler methods with step Size 
彤 三 10, the Symplectic Euler and the Stormer-Verlet method with much larger 
Step Sizes 几 = 100 and 几 = 200, repectively, all over a time period of 200 000 
days. The numerical solution (See Fig. 2.4) behaves Similarly to that for the Kepler 
problem. With the explicit Euler method the planets have increasing energy, they 
Spiral outwards, Jupiter approaches Saturn which leaves the plane of the two-body 
motion. With the implicit Euler method the planets (first Jupiter and then Saturn) 
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fall into the sun and are thrown far away. Both the Symplectic Euler method and 
the Stormer-Verlet Scheme Show the correct behaviour. An integration over a much 
longer time of Say several million years does not deteriorate this behaviour. Let us 
remark that Sussman 狠 Wisdom (1992) have integrated the outer Solar System with 
Special geometric integrators. 


I.3 The Heknon-Heiles Model 


... because: (]) it is analytically simple; this makes the computation of 
the trajectories easy; (2) at the same time, it is sufficiently complicated to 
give trajectories which are far from trivial . (HeEnon 儿 Heiles 1904) 


The Henon-Heiles model was created for describing stellar motion, followed for a 
very long time, inside the gravitational potential Co(r, >) of a galaxy with cylindrical 
Symmetry (HEnon 儿 Heiles 19604). Extensive numerical experimentations Should 
help to answer the question, if there exists, besldes the known invariants 瓦 and 也 ， 
a 1jprzrd invariant, Despite endless tentatives of analytical calculations during many 
decades, Such a formula had not been found. 

After areduction of the dimension, a Hamiltonian in two degrees of freedom of 
the form 


刀 (p,g) = (02 + 区) 十 U7(g) (3.1) 


is obtained and the question is, 让 Such an equation has a seco1d invariant，Here， 
Henon and Heiles put aside the astronomical origin of the problem and choose 


1 
7(g) = 了 (和 十 2) 十 是 g2 一 了 (3.2) 


已 | 哺 


(See citation). The potential L is represented in Fig.3.1. When LV approaches 下， the 
level curves of L tend to an equilateral triangle，whose vertices are Saddle points 
of LV . The corresponding System 























Fig. 3.1. Potential of the Heknon-Heiles Model and a solution 
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Fig. 3.2. PoincarE cuts for qil = 0,Dp1 > 0 ofthe Hknon-Heiles Model for 互 三 十 (6 orbits， 
lefb and 互 三 (] orbit, rightb) 





. 填 妇 Explicit Euler 2 Implicit Euler 
0 光 
0 
















Preoo 人 人 

上 ”inbold: 已 ， .Pioo in bold: Fooo,.….，Psa2g 
Fig. 3.3. PoincareE cuts for numerical methods, one orbit each; explicit Euler (lefb, implicit 
Euler (righb). Same initial data as in Fig. 3.2 











di 三 一 4q1 一 2q192， 如 = 一 0 一 9 十 02 (3.3) 


has solutions with nontrivial properties. For given initial values with 妃 (po, 9qo) < 和 
and do inside the triangle L 乞 二 the solution stays there and moves Somehow like 
amass point gliding on this Surface (See Fig. 3.1, right). 


Poincare Cuts， We fx first the energy 刺 0o and put dlo = 0. Then for any point 
古 = (qzo,pao), we obtain plo from (3.1) as plio = V2Fo -2U0 - p20, where we 
choose the positive root. We then follow the solution until it hits again the Surface 
dl 一 0inthe positive direction pl > 0 and obtain a point 刀 = (q21,p21); in the 
Same way We compnute 媚 = (qdq22,D22j,etc. For the same initial values as in Fig.3.1 
and with 万 0 一 古 ， the solution for 0 入 土生 300 000 gives 46 865 Poincare cuts 
which are all displayed in Fig.3.2 (leftb). They seem to lie exactly on a curve, as do 
the orbits for 5 other choices of initial values. This picture thus Shows“convincing 
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4 (4 
global error 交 
expl. Euler, 几 三 .0001 









AS AM 


/ 
人 心 V SS_sympl Euler 刀 = .0001 












Stormer-Verlet, 下 一 .005 








1 | 
100 ， 200 


万 = 革 


王 


Fig. 3.4. Global error of numerical methods for nearly quasiperiodic and for chaotic solutions; 
Same initial data as in Fig.3.2 


evidence”for the existence of a second invariant, for whlich Gustavson (1966) has 
derived a formal expanslion, whose first termas represent perfectly these cUrveS. 

“But here comes the surprise”(HeEnon-Heiles, p. 76): Fig.3.2 shows to the right 
the same picture in the (qz,Dp2) plane for a somewhat higher Energy 瓦 = 翅 The 
motion turns completely to chaos and all hope for a second invariant disappears. 
Actually, Gustavson 's Series does not converge. 


Numerical 了 xperiments， We now apply numerical methods, the expliicit Euler 
method to the low energy initial values 互 三 古 (Fig.3.3， lefb, and the zzPLzctt 
Euler method to the high energy initial values (Fig.3.3, right), both methods with a 
very Small step size 妃 一 10-5. As wealready expect from our previous experiences， 
the explicit Euler method tends to increase the energy and turns order into chaos， 
while the implicit Euler method tends to decrease it and turns chaos into order. The 
Stormer-Verlet method (not Shown) behaves as the exact Solution even for step SiZzes 
as large as 九 一 10-1. 

In our next experiment we Study the glopal error (See Fig.3.4), once for the case 
of the nearly quasiperiodic orbit (万 三 十 ) and once for the chaotic one ( 互 = 二 )， 
both for the explicit Euler, the Symplectic Euler and the Stormer-Verlet Scheme， 
It may come as a SuUrprise, that only in the first case we have the Same behaviour 
(inear or quadratic growth) as in Fig.2.3 forthe Kepler problem. In the second case 
(五 = 二 ) the global error grows exponentially for all methods, and the explicit Euler 
method is worst. 


Study of a Mapping. The passage from a point 已 to the next one 丈 +1 (as exX- 
plained for the left picture of Fig.3.2) can be considered as a 11apPp118g 允 : 已 一 
万 +1andthe Sequence ofpoints 站 ,万 , 己 , .arejusttheiterates ofthis mapping. 
This mapping ls represented for the two energy levels 刀 三 十 and 克 三 和 in 
Fig.3.3 and its study allows to better understand the behaviour of the orbits. We See 
no Significant difference between the two cases, Simply for larger 成 the deforma- 


tions are more violent and correspond to larger elgenvalues of the Jacobian of 02. In 
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4 一 focus-type fxed point ) 一 Saddle-type fixed point 


Fig. 3.S. The PoincarE map: Po 一 琅 forthe Hhnon-Heiles Model 


both cases we have seven fixed points, which correspond to periodic solutions of the 
System (3.3). Four of them are stable and lie inside the white islands of Fig. 3.2. 


I.4 Molecular Dynamics 


We do not need exact classical trajectories to do this, but must lay great 
emphasis on energy conservation as being of primary importance for this 
Teason. (M.P Allen 和 D.J.Tildesley 1987) 
Molecular dynamics requires the solution of Hamiltonian Systems (1.10), where the 
total energy is given by 
N 1 


V 
瑟 (2,9) = oil-o， (4.D 


1 一 1 1 一 2 7 一 1 
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and Vij(r) are given potential functions. Here, qi and Pi denote the positions and 
momenta of atoms and 7 ls the atomic mass of the ?th atom. We remark that the 
outer solar System (2.12) is such an JV-body system with Vij(7) 三 一 GDi7702j 7. In 
molecular dynamics the Lennard-Jones potential 


帮 () = 4 (() - ( 宇 ) ) (4 


is Very popular (si and ai are Suit- 
able constants depending on the atoms). 
This potential has an absolute minimum 
at distancer 一 ai 如 2. The force dueto 
this potential strongly repels the atoms 
when they are closer than this value， 
and they attract each other when they 
are farther away. 





Lennard - Jones 

















Numerical Experiments with a Frozen Argon Crys- OO) 
tal. As in Biesiadecki 儿 Skeel (1993) we consider the O) G) 
interaction of seven argon atoms in a plane, where SIX of 

them are arranged Symmetrically around a centre atom. CD) 

As amathematical model we take the Hamiltonian (4.1) (9) 人 
with N 一 7, 一 凤 二 66.34.10-27 [kg]， G) 











E 订 二 E 一 119.8 太 加 |， 0i17 二 0 王 0.341 [nmj， 


where jp = 1.380658.10-23 [J/K] is Boltzmann's constant (see Allen 久 Tildesley 
(1987), page 21). As units for our calculations we take masses in [kgl, distances in 
nanometers (1 [nm] = 10-? [ml), and times in nanoseconds (1 [nsec] = 10-9 [sec]). 
Initial positions (in [nm]) and initial velocities (in [nmy/nsec]) are given in Table 4.1. 
They are chosen Such that neighbouring atoms have a distance that ls close to the 
one with lowest potential energy, and such that the total momentum is zero and 
therefore the centre of gravity does not move. The energy at the initial position ls 
妃 (po,dqo) 之 一 1260.2KP 站 . 

For computations in molecular dynamics one ls usually not interested in the tra- 
jectories of the atoms, but one aims at macroscopic quantities Such as temperature， 
pressure, Internal energy, etc. Here we consider the total energy, given by the 了 amil- 
tonian, and the temperature which can be calculated from the formula (See Allen 多 


Table 4.1. Initial values for the Simulation of a frozen argon crystal 














atom 1 兄 所 : 4 加 6 7 
本 0.00 0.07 0.34 0.36 二 0.02 二 0.35 二 0.3I 
Feee 0500 0.39 0.17 一 0.21 一 0.40 一 0.16 0.21 
人 二 30 50 二 70 50 80 二 30 二 80 
| 二 220 一 90 一 60 40 90 100 一 60 
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explicit Euler, 尹 三 0.5 [fsec|] 30[ Verlet, 尹 = 40 [fsec] 








30 上 
_30F 
0 
[ Symplectic Euler, 见 = 10 [fsec] 30[ Verlet 尹 王 80 [fsec] 
_30[ [ 
0 


total energy 由 total energy 


explicit Euler, 见 = 10 [fsec] [ Verlet, 万 二 10 [fsec] 








上 Verlet 刀 三 20 [fsec] 
Symplectic Euler, 尺 = 10 [fsec] 


[temperature ts 


Fig. 4.1. Computed total energy and temperature of the argon crystal 














Tildesley (1987), page 40) 


V 
1 
人 一 一 dz||2. 4.3 
5 | 伸 (4.3) 


We apply the explicit and symplectic Euler methods and also the Verlet method 
to this problem. Observe that for a Hamiltonian Such as (4.1) all three methods 
are explicit, and all of them need only one force evaluation per integration Step. In 
Fig.4.1 we present the numerical results of our experiments. The integrations are 
done over an interval of length 0.2 [nsecj. The step Sizes are indicated in femtosec- 
onds (1 [fsec] = 10-5 [nsec])， 

The two upper pictures show the values ( 忌 (pn, dan) 一 (20， do0)) 六 KB as a func- 
tion oftime 妃 三 1. Forthe exact solution, this value is precisely zero for all times. 
Similar to earlier experiments we see that the Symplectic Euler method is qualita- 
tively correct, whereas the numerical solution of the explicit Euler method, although 
computed with a much smajller step Size, 1S completely useless (See the citation at 
the beginning of this section). The Verlet method is qualitatively correct and gives 
much more accurate results than the Symplectic Euler method (we shall see jlater 
that the Verlet method is of order 2). The two computations with the Yerlet method 
Show that the energy error decreases by a factor of4 半 the step Size ls reduced by a 
factor of 2 (Second order convergence). 

The two lower pictures of Fig.4.1 show the numerical values of the temperature 
difference 下 一 7 with 并 given by (4.3) and 70 盖 22.72 [ 玉 ] (initial temperature). 
In contrast to the total energy, this is not an exact invariant, but for our problem it 
fuctuates around a constant value. The explicit Euler method gives wrong results， 
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but the symplectic Euler and the Verlet methods show the desired behaviour. This 
time a reduction of the step Size does not reduce the amplitude of the oscillations， 
which indicates that the fuctuation of the exact temperature is of the Same Size. 


Is Highly Oscillatory Problems 


In this Section we discuss a System with almost-harmonic high-frequency oscilla- 
tions. We Show numerical phenomena of methods applied with step Sizes that are 
not Small compared to the period of the fastest oscillations. 


I.S.1 A Fermi-Pasta-Ulam Problem 


...dealing with the behavior of certain nonlinear physical systems where 
the non-linearity is introduced as aperturbation to aprimarily linear prob- 
lem. The behavior ofthe systems is to be studied for times which are long 
compared to the characteristic periods of the corresponding linear prob- 
lems. ( 卫 . Fermil, J. Pasta, 8. Ulam 1959) 


In the early 1950s MANIAC-I had just been completed and Sat poised 
for an attack on significant problems. … Fermi suggested that it would 
be highly instructive to integrate the equations of motion numerically for 
a judiciously chosen，one-dimensional, harmonic chain of mass points 
weakly perturbed by nonlinear forces. (J. Ford 1992) 


The problem of Fermi, Pasta 久 Ulam (1955) is a simple model for Simulations in 
statistical mechanics which revealed highly unexpected dynamical behaviour. We 
consider a modification consisting of a chain of 2 mass points, connected with al- 
ternating soft nonlinear and stiff linear Springs, and fxed at the end points (See Gal- 
ganli, Giorsgilli, Martinoli & Vanzini (1992) and Fig.3.1). The variables q1,. .92m 





Stif 人 Soft 
harmonlc nonlinear 


Fig. $.1. Chain with alternating soft nonlinear and stiff linear Springs 


(ao = q2m+l = 0) stand for the displacements of the mass points, and Di; 三 di for 
their velocities. The motion ls described by a Hamiltonian System with total energy 


7 2 77 77 
工 CW 
五 (D, 9) 三 7 >》 (2 十 D2i) 机 本 >》 (gz 一 02i-1 六 十 》 (gil 一 02i) 
记 1 i=1 1 一 0 


Where w ls assumed to be large. It is quite natural to introduce the new variables 
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0 





Fig. S$.2. Exchange of energy in the exact Solution of the Fermi-Pasta-Ulam model. The 
Picture to the right is an enlargement of the narrow rectangle in the left-hand picture 


Y0i 一 (az 十 02i_1)/V2， Y1 一 (2 一 02i1)/V2， 
0 一 (pz 十 Dai_1)/V2， V1 二 (pz2i -poi-1)/V2， 


Where Zoji ( 三 1,...,70) Tepresents a Scaled displacement of the ?th Stifft Spring， 
21i a Scaled expansion (Or compression) of the ;th stiff Spring, and yo V1; their 
velocities (or momenta). With this change of coordinates, the motion in the new 
variables ls again described by a 甩 amiltonian System, with 


(3.1) 





TV 人 777 
1 W 1 
五 (2) 王 了 | 十 人 十 可 Z1 十 il((eoa 一 241) 十 
4 一 1 1 一 1 
7 一 工 





4 
十 二 (zoil 一 01,iH1 一 201 一 1 十 (Zoum 十 下 
证 1 


(3.2) 
Besides the factthatthe equations ofmotion are Hamiltonian, So thatthe total energy 
is exactly conserved, they have a further interesting feature. Let 


| 
T(za7) = (0 十 wz15) 区 3 


denote the energy of the 7th stiff Spring. It turns out that there is an exchange of 
energy between the Stifft springs, but the total oscillatory energy 了 工 王 六 十 ... 十 
Tr Temains close to a constant value, in fact T((z(,y() = T((z(0),y(O)) 
O(w-I). For an illustration of this property，we choose mm = 3 (as in Fig.5.1)， 
w 王 50， 





| 


201(0) =1， yi(0) =1，2z11(00) =w 1(0) =1， 


and zero for the remaining initial values. Fig.3.2 displaygs the energies 嫉 ,72,73 
of the stiff Springs together with the total oscillatory energy 了 三 站 十 72 十 13asa 
function oftime. The solution has been computed very carefully with high accuracy， 
So that the displayed oscillations can be considered as exact. 
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I.S.2 Application of Classical Integrators 


Which of the methods of the foregoing sections produce quajitatively correct ap- 
proximations when the product of the step Size 下 with the high frequency w is rela- 
tively large?7 


Linear Stability Analysis. To get an idea of the maximum admissible Step Size， 
weneglect the quartic term in the Hamiltonian (5.2), So that the differential equation 
Spjlits into the two-dimensional problems yo 三 0, Zoji 三 yoi and 


0 一 一 wo2z1i， 2 一 全 (5.4) 


Omitting the Subscripts, the solution of (3.4) 1s 


CR) 


The numerical solution of a one-step method applied to (3.4) yields 


Vm 十 1 2 2 
(| 一 M(jpu) (3.5) 


and the eigenvalues Xi of M(hw) determine the long-time behaviour of the numeri- 
cal Solution. stability (il.e., boundedness of the solution of (3.3)) requlires the eligen- 
values to be less than or equal to one in modulus. For the explicit Euler method 
we have Al2 一 工 士 1po, so that the energy 刀 一 (2 十 w2z2)/2 increases as 
(1 十 ji2w2)"/2. For the implicit Euler method we have Xi = (1 十 1pu)-l and 
the energy decreases as (1 十 12w2)-"/2. For the implicit midpoint rule, the ma- 
trix MT(Aw) is orthogonal and therefore 7 is exactly preserved for al 尹 and for all 
times. Finally, for the symplectic Euler method and for the Stormer-Verlet Scheme 
we have 


1 三 雇员 1 尹 22 几 w 1 万 2w2 
Maw= (已 | ww- 人 忆 2 ) ) 


CU 玉 2w2 
交 1 一 芝 











Tespectively. For both matrices, the characteristic polynomial is X 一 (2 一 jw 入 二 1， 
So that the eigenvalues are of modulus one if and only 庄 |Pw| < 2. 


Numerical Experiments，We apply several methods to the Fermi-Pasta-Ulam 
(FEPU) problem,， with w = 50 and initial data as given in Sect.I.5.1. The explicit 
and implicit Euler methods give completely wrong solutions even for very Small 
Step SlzeS. Fig.3.3 presents the numerical results for 妃 , 7 , 12, 73s obtained with 
the implicit midpoint rule, the Symplectic Euler and the Stormer-Verlet Scheme. 
For the small step size 玫 三 0.001 all methods give satisfactory results, although the 
energy exchange is not feproduced accurately over long titmes. The Hamiltonian 瓦 
and the total oscillatory energy 7 are well conserved over much longer time inter- 
vals. The larger Step Size 几 = 0.03 has been chosen Such that jw = 1.5 is close 
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Fig. $.3. Numerical solution for the FPU problem (3.2) with data as in Sect.I.3.1, obtained 
with the implicit midpoint rule (lefb, symplectic Euler (middle), and Stormer-Verlet Scheme 
(righb; the upper pictures use 玉 三 0.001, the lower pictures 妨 三 0.03; the first four pictures 
Show the Hamiltonian 互 一 0.8 and the oscillatory energies 厂 , 172, 73,7 the last two pictures 
only Show 12 and 


to the Stability limit of the Symplectic Euler and the Stormer-VYerlet methods. The 
values of 互 and y are Still bounded over very long time intervals, but the oscillations 
do not Tepresent the true behaviour. Moreover the average value of 7 ls no longer 
close to l, as it ls for the exact Solution. These phenomena call for an explanation， 
and for numerical methods with an improved behaviour (See Chap.XIID). 


I.6 了 Xercises 


1. Show that the Lotka-Volterra problem (1.1) in logarithmic scale, ie., by putting 
2D= 三 logandd=]logv,becomesaHamiltonian system with the function (1.4) 
as Hamiltonian (See Fig.06.1). 


fow in log. Scale 











Fig. 6.1. Area preservation in logarithmic scale of the Lotka-Volterra fow 
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2. Apply the symplectic Euler method (or the implicit midpoint rule) to problems 


SUch as 
的 (AR 


with various injtial conditions. Both problems have the Same first integral (1.4) 
as the Lotka-Volterra problem and therefore their solutions are also periodic. 
Do the numerical solutions also Show this behaviouTr” 

3. A general two-body problem (Sun and planeb is given by the Hamiltonian 


CAT 
le 一 gs 由 





1 1 
万 (p,ps,9q,ds) = 0 站 2Z7D 


where qs,q E 月 3 are the positions of the sun (mass AM) and the planet (mass 
7), Ds,D E 了 3 are their momenta, and G is the gravitational constant. 
a) Prove: in heliocentric coordinates CO := 一 qs,the edquations of motion are 





可 QQ 

We 
b) Prove that 生 (@ 人 雪 x Q 人 ) 一 0, so that @ 人 stays for all times ti in the 
plane 瑟 = {a;i dzd=0} whered=Q(0) xQ(0). 
Co1clusio11. The coordinates corresponding to a basis in 瓦 Satisfy the two- 
dimensional equations (2.2). 

4. In polar coordinates, the two-body problem (2.2) becomes 


六 二 一 芭 (站 with VY(r) = 一 一 一 


which is independent of po. The angle (tb can be obtained by simple integration 
from 2 人 = Do]r2 人. 
5. Compnute the period of the solution of the Kepler problem (2.2) and deduce 
from the result Keplers “third law”. 
ZL Comparing 天 eplers Second law (2.0) with the area of the ellipse gives 
地 Lo7 = abr. Then apply (2.7). The resultis 亿 一 27r(2| 瑟 0|) -3/2 一 2ra3/2. 
606. Deduce Kepler s first law from (2.2) by the elegant method of Laplace (1799). 
ZL Multiplying (2.2) with (2.3) gives 


0 (的 )， 和 一 瑟 (全 
and after integration 7odl = 至 十 也 ,7o0a = 一 全 十 4 where 4and 忆 are 
integration constants. Then eliminate dl and dz by multiplying these equations 
by da2 and 一 dl respectively and by Subtracting them. The result is a quadratic 
edquation in ql and aq2. 

7. Whatever the initial values for the Kepler problem are, [1 十 2 石 0 并 1 过 0 holds. 
Hence, the value e ls well defined by (2.9). 
矶 AL Lo is the area of the parallelogram spanned by the vectors 9q(0) and 9(0). 
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10. 


11. 


12. 


I_ Examples and Numerical Experiments 


.VDle11ae1l1ation or ipe S1OFm11e/ 一 Weref schne111e. Explain why the use of the one- 


step formulation (1.17) 1 numerically more Stable than that of the twor-term 
recursion (1.15). 


. 有 RU18e-Le1nz-Paxli vector Prove that the function 


D1 0 1 d1 
4(p,9g) 三 | pa |x 0 一 一 一 2 


d 
/有 
0 d1D2 一 942D1 9 十 9 \0 


is a first integral of the Kepler problem, ie.,， 4(p(b,d( 执 ) = Const along 
Solutions of the problem. However it 1 not a first integral of the perturpbed 
Kepler problem of Exercise 12. 

Add a column to Table 2.1 which shows the long-time behaviour of the error in 
the Runge-Lenz-Pauli vector (See Exercise 9) for the various numerical inte- 
gratorS. 

For the Kepler problem, eliminate (pl1, pz2) from the relations 万 (P,d) 王 Comst， 
了 (D,9) = Const and 4(p,9q) = Const. This gives a quadratic relation for 
(qi,dq2) and proves that the Solution lies on an ellipse, a parabola, or on a hy- 
perbola. 

Study numerically the solution of the perturbed Kepler problem with Hamil- 
tonian 


_ 工 /2 2 1 此 
0 


where /LI a positive or negative Small num- 








ber. Among others, this problem describes 一 0 
the motion of a planet in the Schwarzschild 

potential for Einstein's general relativity the- | SN 
ory7. You will observe a precession of the 世 
perihelion, which, appliedto the orbit of Mer- 1 


CuUIy, Tepresented the historically first verif- 
cation of Einstein's theory (See e.g., Birkhof 仁 
1923,p.201-204). 





The precession can also be expressed analytically: the equation forv = 1/r as 
afunction of 2, corresponding to (2.8), here becomes 


1 
十 也 一 本 十 Nu， (0.1) 


where d = 71. Now compnute the derivative of this solution with respect to 1 
at 王 0andv=(1+ecos(p 一 PP))/dafter one periodt = 27r. This leads to 
刀 一 NU(e/d2) :27rsin op (see the small picture). Then, for small /u, the precession 
after one period is 

2T 人 


人 
7 一 


(0.2) 


7” We are grateful to Prof. Ruth Durrer for helpful hints about this subject. 


Chapter 工 . 
Numerical Integrators 


After having seen in Chap.I Some Simple numerical methods and a variety of nu- 
merical phenomena that they exhibited,， we now present more elaborate classes of 
numerical methods. We start with Runge-Kutta and collocation methods, and we 
introduce discontinuous collocation methods, which cover essentially all high-order 
implicit Runge-Kutta methods of interest, We then treat partitioned Runge-Kutta 
methods and Nystrom methods, which can be applied to partitioned problems such 
as Hamiltonian Systems. Finally we present composition and splitting methods. 


II.1 Runge-Kutta and Collocation Methods 





Fig. 1.1. Carl David Tolme Runge (left picture), born: 30 August 1856 in Bremen (Germany)， 
died: 3 January 1927 in Gottingen (Germany). 

Wilhelm Martin Kutta (right picture), born: 3 November 1867 in Pitschen, Upper Silesia (now 
Byczyna, Poland), died: 25 December 1944 in Furstenfeldbruck (Germany) 


Runge-Kutta methods form an Important class of methods for the integration of 
differential equations. A Special Subclass, the collocation methods, allows for a par- 
ticularly elegant access to order, Symplecticity and continuous output. 
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II.1.1 Runge-Kutta Methods 


In this section, we treat non-autonomous Systems of first-order ordinary differential 
equations 


= 上坟 幼 ， yto) = V0. (1.1D) 


The integration of this equation gives y( 妇 ) = yo 十 上 jb) at andreplacing 
the integral by the trapezolidal rule, we obtain 


太一 加 十 二 (tooo) + Fn 扩 )， (1.2) 


This ls the zzPlicil 1rapezoidal Fe，which，in addition to its historical impor- 
tance for computations in partial differential equations (Crank-Nicolson) and in 
A-stability theory (Dahlquist), Played a crucial role even earlier in the discovery of 
Runge-Kutta methods. It was the Starting point of Runge (1895), who “predicted” 
the unknown y1-value to the right by an Euler step, and obtained the first of the 
following formulas (the second being the analogous formula for the midpoint rule) 


ji = to,yo) ji 三 帮 to,yo) 
pa = 帮 to 十 几 Vo 十 几 pi) ja = to 十 全 ,90 十 生 1) (1.3) 
册 二 加 十 和 (1 十 ja) 1L 三 Vo 十 Po2. 


These methods have a nice geometric interpretation (which is illustrated in the first 
two pictures of Fig.1.2 for a famous problem, the Riccati equation): they consist 
of polygonal lines, which assume the Slopes prescribed by the differential equation 
evaluated at previous points. 


aea of Fevmn (7900) da1d Kxtta (79071 六 compute seyveral polygonal lines, each start- 
ing at yo and assuming the various Slopes ji; on portions of the integration interval， 
which are proportional to some given constants aij; at the final point of each poly- 
gon evaluate a new Slope 1 态 . The last of these polygons，with constants 避 ，deter- 
mines the numerical solution V1 (See the third picture of Fig. 1.2). This idea leads to 
the class of expiici Runge-Kutta methods, ji.e., formula (1.4) below with ai = 0 
forz < 7. 


expl. midp. ruley/ 





V1 




















1 
2 世 1 


Mb 
c 


Fig. 1.2. Runge-Kutta methods for 刀 王女 十 轨 2,oyo =0.46, 彤 = 1; dotted: exact solution 
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Much more important for our purpose are 17Plict Runge-Kutta methods, intro- 
duced mainly in the work of Butcher (1963). 


Definition 1.1. Let Di aij (7 三 1 二. .,5) bereal numbers and let ci 一 | 局 
An s-81aSge RU118e-KU1ta 11etpod is given by 


语 一 j( 旨 + ci 加 二 天》 aij 阿 )， 和 

(1.4) 
凡 一 加 十 凡 》 记 有 

1 一 1 


Here weallow afull matrix (ai7) ofnon-zero coefficients. In this case, the slopes 
ji can no longer be computed explicitly, and even do not necesSsarily exist. For ex- 
ample, for the problem set-up of Fig. 1.2 the implicit trapezolidal rule has no Solu- 
tion. However, the implicit function theorem assures that, for sufficiently Small 尺 ， 
the nonlinear System (1.4) for the values 1,...,Ks。has a locally unique solution 
close to i 忆 FUto,V0). 

Since Butcher's work, the coefficients are usually displayed as follows: 





ClL | QIl1 . .0Q1s 
: (1.5) 

Cs | Qs1 CQss 

| 访 1 


Definition 1.2. A Runge-Kutta method (or a general one-step method) has o7der D， 
过 for all sufficiently regular problems (1.1) the /1ocal error 册 一 V(to 十 尹 ) satisfies 


册 一 V( 加 十 癌 三 Of as 天 一 0. 


To check the order of a Runge Kutta method，one has to compnute the Taylor 
Series expansions of y(to 十 确 ) and Vi around to 玫 = 0. This leads to the following 
algebraic conditions for the coefficients for orders 1, 2, and 3: 


> 有一 1 for order 1; 

in addition > poici 一 1/2 for order 2; 
in addition > ,5cz 一 1/3 

and 注 六 5 baijcj 三 1/6 for order 3. 


(1.0) 


For higher orders, however, this problem represented a great challenge in the first 
half of the 20th century. We Shall present an elegant theory in Sect.IUI.1 which 
allows order conditions to be derived. 

Among the methods seen up to now, the explicit and implicit Euler methods 


0 1 11 
人 (1.7) 
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are of order l, the implicit trapezoidal and midpoint rules as well as both methods 
of Runge 


0 0 0 
1 | 1/2 1/2 1/2 | 1/2 1| 1 1/2 | 1/2 
| 1/2 17/2 | 1 | 1/2 172 | 0 1 





are of order 2. The most Successful methods during more than half a century were 
the 4th order methods of Kutta: 








0 0 
1/2 | 1/2 1/3 | 1/3 
LI/2| 0 1/2 2/3 | -1/3 1 (1.8) 
1|0 0 1 1 1 = 人 下 
| 1/6 2/6 2/6 1/6 | 18 3/8 3/8 1/8 


II.1.2 Collocation Methods 


The high speed computing machines make it possible to enjoy the advan- 
tages of intricate methods. (PC. Hammer 上 放 J.W. Hollingsworth 1955) 


Collocation methods for ordinary differential equa- 

tions have their origin，once again, in the implicit 

trapezoidal rule (1.2): Hammer 贸 Hollingsworth 2 
(1953) discovered that this method can be interpreted [7 
as being generated by a guUadratic fmctionm “which 

agrees in direction with that indicated by the differen- 7 

tial equation at two points” to and 六 (See the picture | | 
to the righb. This idea allows one to“see much-used 如 to 十 岂 /2 刀 
methods in a new light” and allows various general- 

izations (Guillou 儿 SoulE (1969), Wiight (1970)). An interesting feature of collo- 
cation methods is that we not only get a discrete Set of approximations, but also a 
CO11I1UOVUS GDP7oOXI1G1io1t to the Solution. 


工 





Definition 1.3. Let cl,...,cs be distinct real numbers (usually 0 < ci 忒 1). The 
colliocation polymo7aialu(t is apolynomial of degree s Satisfying 


utto) 一 90 
. ， (1.9) 
t(to 十 ci 万) ES Fio+ciju(to 二 ci 站))， 和 5 
and the numerical solution of the coliocatiom maetjpod is defined by Vi 三 WU(to 十 刀 ). 
For s = 1, the polynomial has to be of the form (人 = yo 十 人 一 如 )5 with 
谨 三 J(to 十 C1 几 ， 20 十 cl1)， 


We see that the explicit and implicit Euler methods and the midpoint rule are collo- 
cation methods with cl = 0, cl = 1 and cl = 1/2, respectively. 
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”impl.trap. rule impl. midp. rule GausS4 




















MD 
一 





























Fig. 1.3. Collocation solutions for the Lotka-Volterra problem (IL.1.1); wo 三 0.2, vo 三 3.3; 
methods of order 2: four steps with 玉 = 0.4; method of order 4: two steps with 尺 二 0.8; 
dotted: exact Solution 


FEors=2andecl = 0cz = 1 wefnd of 一 一 一 
course, the implicit trapezolidal rule. The choice of ES 
了 Hammer 狠 Hollingsworth for the collocation points 
is cl 王 1/2 士 V3/6， the Cazxssia1 gUGG1atz1e nodey 2 
(See the picture to the righb. We will see that the cor- ! 1 
ITesponding method is of order 4. t0 1 十 ci 太 加 十 cz 太吉 

In Fig.1.3 we illustrate the collocation idea with 
these methods for the Lotka-Volterra problem (I.1.1). One can observe that, in Spite 
of the extremely larsge step Sizes, the methods are qulte satisfactory. 








Theorem 1.4 (Guillou & Soule 1969，Wright 1970). 7jpe co1ocatiom 71etpod of 
Dejizaiztio1 1.3 18 eqgUivalent io 1 加 e 5s-S1a8e RU118e-KUIG 111etpod (7 和 wii coe 扩 - 
Cie1118 


aij 二 由 bj(T)da7， ， 访 二 aa adT， (1.10) 
0 
Where Li(T) 1 tphe La810118e Po1Dynio1aial bitT) 三 [LiTr 一 co) 人 ci 一 c0). 
Poof Letv(b be the collocation polynomial and define 
1 :一 人 (to 十 ci 帮 )， 
By the Lagrange interpolation formula we haveu(to 十 T7) 一 27-1 1 iT)， and 
by integration we get 


u(to 十 ci 万 ) -+ny 石 / bi(7T) d7. 
1 


JInserted into (1.9) this gives the first formula of the Runge-Kutta equation (1.4). 
JIntegration from 0 to 1 yields the Second one. 
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The above proof can also be read in reverse order. This Shows that a Runge 一 
Kutta method with coeffticients given by (1.10) can be interpreted as a collocation 
method. Since 六 一半 六 -1 cf bj(7) for 开 = 1,...，,s, the relations (1.10) are 
equivalent to the linear Systems 


C(9) >》 ,aijcp 1 下 一 1 ) 9 all ; 

7 (1.11) 
BOD): >》 jc 一 工 天 一 1 

于 


with 一 sandDp=5s. Whatis the order of aRunge-Kkutta method whose coeff- 
cients 0i, aij are determined in this way? 

Compared to the enormous difficulties that the first explorers had in constructing 
Runge-Kutta methods of orders 5 and 6, and also compared to the difficult algebraic 
proofs of the first papers of Butcher the following general theorem and its prootf， 
discovered in this form by Guillou 儿 SoulE (1969), are SuUrprisingly Simple. 


Theorem 1.S (Superconvyergence). 矿 力 e cozmdifion 已 (D) jpol1ds Jor someD > 8 
加 e1l 1 力 e coliocatio1z 11etpod (De1jzzazition 71.3) 1as orderD. 7T1is 1ea128 tat 1 加 e collio- 
CQ1iO1 111etpod jas 1je Sa111e o1der ds 加 e Mideriyi1ag quUad1att1e rm2Ula. 


Pioof We consider the collocation polynomial v (加 as the solution of a perturbed 
differential equation 

色 一 大 太 十 5 区 (1.12) 
with defect 6(b := 了 习 轨 一 多 ). Subtracting (1.D from (1.12) we get after 
linearization that 


za0-30= 芝 (sj(wO-yD)+a0+rO GD 


Where, for 如 区 二 近 加 十 九 the remainderr(b is of size O(uw 人 的 一 y 的 上) = 
O(P2s+2) byLemma 1.6 below. The variation of constants formula (see e.g., Hairer, 
Ngwrsett & Wanner (1993), p. 060) then yields 
to 十 
太一 gto+ 间 一 Mto+ 用 -gto+ 问 一 人 BRGto+ 记 s(5(s)+r(s)) gs (19) 
to 
Where 民 (t s) is the resolvent of the homogeneous part of the differential equa- 
tion (1.13)， ie.，the solution of the matrix differential equation 0O 忆 (人 太 s)/6t = 
4 人民 ( 态 s), 民 (ss) 三 志 with 4 的 = DJ/6y( 信 V). The integral over 民 ( 如 十 
几 , s)7r(s) gives a O(h2s+3) contribution. The main idea now is to apply the quadra- 
ture formula (bci)2_1 to the integral over qg(5) = 民 (to 十 几 s)0(s); because the 
defect 0(s) vanishes at the collocation points 如 十 ci for = 1 ...，,s, this gives 
Zero as the numerical result. Thus, the integral is equal to the quadrature error which 
is bounded by Ph2+l times a bound of the pth derivative of the function 9g(s). This 
derivative is bounded independently of 凡 ，because by Lemma 1.6 all derivatives 
of the collocation polynomial are bounded uniformly as 六 一 0. Since,， anyway， 
D<2s,wegetW 一 yt 十 P=O(An2+l) from (1.14). 
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Lemma 1.6. 77jpe coliocatiomn Polymommial zu(b 六 aa apPjoxiniation oforder sto te 
EXGC1 SOUUIO Of (1 o1 te wpole Iterva1 Le,， 


lu 昌 一 VE< CR Jor teftot 十 败 (1.15) 


G11G Jr Siciemt1iy S12G1 邦 . 
Moreoyver tpe deriyativesy ofu( 人 bsatisPJjorte ltot 如 十 咱 


la 一 WOD< CstR Jor 大 一 0 .5. 
Prooj The collocation polynomial Satisfies 
ato 十 Th = S j(+caltto+c 站 6， 
让 1 
while the exact solution of (1.1) satisfies 
yto 十 T7) 一 人 ja 十 cij,y(to 十 ci Li(T) 十 大 羽 (T, 用 )， 
i 一 1 
Where the interpolation error 瓦 (7, 刀 ) is bounded by maxtefto ,to+ 阁 ys+D (bs 


and its derivatives Satisfy 


6 
| 人 有 | 人 icE (s 一 上 十 1)! 


This follows from the fact that, by Rolle's theorem, the differentiated polynomial 
> jito +cijy(to 二 ci) Le D(7) can be interpreted as the interpolation 
polynomial of it-1yCe(to 十 Frj) at s 一 大 十 1points lying in [to,t 如 十 川 . Integrating 
the difference of the above two equations gives 


V(to 十 7 由) 一 &(t 如 十 TP) 王 六 >》， 人 广 li(a)dc 十 Ps 人 五 (a,]) dc (1.10) 
5 0 0 
With 人 广 一 丰 (t 十 Ci 用 ， 2(t 十 ci 由)) j(t 十 Ci 用 ， ul(to 十 ci 站 )) Using a Lipschitz 
condition for Fit y), this relation yields 


max |y 扩 一 坟 | 和 ARC max yy 的 一 坟 | 二 Comst ， js 
tcf[to,to 十 咱 tcE[to,to 十 阁 


implying the statement (1.15) for sufficiently small 闵 > 0. 
The proof of the second statement follows from 


及 (go+7 一 uto+Tr 间 ) = 天》 4 肠 可 (二 D(r 关 
?一 | 














by using a Lipschitz condition for j 丰 上 yy) and the estimate (1.19)， 
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II.1.3 Gauss and Lobatto Collocation 


Gauss Methods. If we take cl,，.. 
polynomial 


.),Cs as the zeros of the sth Shifted Legendre 


(CE 二 下) ) 

the interpolatory quadrature formula has order Dp 三 2s, and by Theorem 1.3, the 
Runge-Kutta (or collocation) method based on these nodes has the Same order 25. 
For s = 1 we obtain the implicit midpoint rule. The Runge-Kutta coefficients for 
s 一 2(the method of Hammer 狠 Hollingsworth 19353) and s = 3are given in 
Table 1.1. The proof of the order properties for general s was a Sensational result of 
Butcher (1964a). At that time these methods were considered, at least by the editors 
of Matj. of Comapxt, to be purely academic without any Practical value; 3 years 
later their 4-stabjlity was discovered, 12 years later their 已 -stability, and 25 years 
later their Symplecticity. Thus, of all the papers in issue No. 85 of Ma11. of ComaptL， 
the one most Important to us ls the one for which publication was the most difficult. 


Table 1.1. Gauss methods of order 4 and 6 
































1  V3 1 1  V3 
广电 4 4 6 
1 V311 V3 1 
2+ 6 3416 4 
1 上 
2 2 
1  V15 5 V15 5 15 
2 10 36 9 15 36 30 
1 5 ， V15 2 5 15 
号 36 ”24 9 36 24 
1 是 V15|15 V15 2 计 V15 5 
2 ， 10 1136 30 9 15 36 
5 4 5 
18 9 18 





了 adau Methods. Radau quadrature formulas have the highest possible order， 
25 一 1 among quadrature formulas with either cl = 0 or cs 三 1. The correspond- 
ing collocation methods for c。 = 1 are called Radau JIA methods. They Play an 
important role in the integration of stiff differential equations (See Hairer 儿 Wanner 
(1996), Sect. TIV.8). However, they lack both syz11ietry and Sy1iPlecticity, properties 
that will be the subjects of later chapters in this book. 


Lobatto IIUJIA Methods. Lobatto quadrature formulas have the highest possible or- 
der with cl = 0 and c。 = 1. Under these conditions, the nodes must be the zeros 
of 
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as 一 2 
二 (5 (z 一 1 和 ) (17) 


and the quadrature order js 2 = 2s 一 2. The corresponding collocation methods are 
called, for historical reasons, Lobatto IIA methods. For s = 2 we have the implicit 
trapezoidal rule. The coefficients for s 王 3ands 王 4aregivenin Table 1.2. 


Table 1.2. Lobatto IIA methods of order 4 and 6 




















0 0 0 
1|15 1 1 
2|24 3 24 
1 2 1 
16 3 6 
1 2 1 
6 3 6 
0 0 0 0 0 
5--V5 |11+V5 25 一 V5 25 一 13V5 -1+V5 
10 120 120 120 120 
5+V5 |11-v5 25+13v5 25+V5 -1-V5 
10 120 120 120 120 
1 5 于 1 
12 12 12 12 
1 5 5 1 
12 12 12 12 





II.1.4 Discontinuous Collocation Methods 


Collocation methods allow, as we have Seen above, a very elegant proof of their 
order properties. By Similar ideas, they also admit strikingly simple prootfs for their 
4- and 已 -stabiljity as well as for symplecticity, our subject in Chap. VI. However， 
not all method classes are of collocation type. It is therefore interesting to deftine a 
modification of the collocation idea, which allows us to extend all the above proofs 
to mmuch wider classes of methods. This definition will also lead, later, to Important 
classes of pa1tiLiomed methods，. 


Definition 1.7. Let cz,...,cs_lbe distinctreal 六 


numbers (usually 0 < ci 挟 1]), and let 01,0。 


be two arbitrary real numbers. The correspond- V /ba 
ing disco71iUOUS coliocatiom 1ietjpod is then 2 SS 
defined via a polynomial of degree s 一 2 Sat- 2 
isfying ] 浊 








uto) = 加 一 Pbi 人 (加 ) 一 tou(to))) 
尼 ( 加 十 cij) = t+ciju( 如 十 Cj) 一 2 5 一 了 (1.18) 
1 一 2 三 ) 一 Ps 人 人) 一 Fu 人 5)))， 
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The fgure gives a geometric interpretation of the correction term in the first and 
third formulas of (1.18). The motivation for this deftnition will become clear in the 
proof of Theorem 1.9 below. Our first result shows that discontinuous collocation 
methods are equivalent to implicit Runge-Kutta methods. 


Theorem 1.8. 77Pe discom1inUous coliocafio1z 111ethod of De1izzition 1.718 egUivalet 
1O G1 5-S108e RU118e-KU1ta 111etpod (7.4) WIN coejpcients delermained py cl = 0， 
cs 一 1 a10Q 

ail 一 01， ais 二 0 省 产 人 二 于 5 总 

C(s 一 2) G10 刀 (s 一 2)， 
Witp tpe co1nditionsy C(dq) anad 已 (D) or (7.7 由 


Pioof As in the proof of Theorem 1.4 we put 局 := (to 十 cip) (this time for 
1 一 2,...,5 一 ,sothat (to 十 T7) 一 于 一 1i 六 bi(T) by theLagrange interpolation 
formula. Here, bj(7) corresponds to cz,...,cs_landisapolynomial of degree s 一 3. 
By integration and using the deftinition of w(to) we get 


(1.19) 


放 2 


3 一 | Ci 
= 加 十 ]1i 了 十 尹 》 放 (/ bi(7) dr 一 O(0)) 
7 一 2 所 


with fi = /yo). Inserted into (1.18) this gives the first formula of the Runge-Kutta 
equation (1.4) with aij = 帮 bi(r)dr 一 1p01(0). As for collocation methods, one 
checks that the aij are Uniquely determined by the condition C(s 一 2). The formula 
for Wi is obtained Similarly. 














Table 1.3. Survey of discontinuous collocation methods 























type characteristics prominent examples 
0 一 0, bs 一 0 | (s 一 2)-stage collocation Gauss, Radau IIA, Lobatto IIA 
1 一 0, bs 天 0 | (s 一 -stage with uis 一 0 methods of Butcher (1964b) 
1 天 0, bs 一 0 | (s 一 J)-stage with ail 一 呈 Radau IA, Lobatto IIC 
pb1 天 0,，bs 和 关 0 | s-stage with ail 一 0，ais 一 0 | Lobatto IIB 





开 0 三 0inDeftinition 1.7, the entire first column in the Runge-Kutta tableau 
vanishes, So that the first stage can be removed, which leads to an equivalent method 
with s 一 1 stages. Similarly, 过 ps。 三 0, we can remove the last Stage. Therefore, we 
have all classes of methods, which are “continuous” either to the left, or to the right， 
Or on both Sides, as Special cases in ouUr definition. 

In the case where 0 一 50。 一 0, the discontinuous collocation method (1.18) is 
equivalent to the (s 一 2)-stage collocation method based on cz,...,cs_l1 (see Ta- 
ble 1.3). The methods with b。 = 0 but 0 和 0, which include the Radau IA and 
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Table 1.4. Lobatto IIB methods of order 4 and 6 




















1 -1-VvV5 -1+V5 
交 24 24 
5--V5 | 1 25 十 V5 25 一 13V5 站 
0 人 0 10 12 120 120 
和 于，: 洒 5+TV5 | 1 25+13v5 25 一 V5 
72|6 3 10 12 120 120 
1 5 1 11 一 V5 11 十 V5 
15 5 1 12 24 24 《 
1 2 1 1 5 5 1 
6 3 6 12 12 12 12 








Lobatto IIUC methods, are of interest for the solution of stiff diffterential equations 
(Hairer 人 放 Wanner 1996). The methods with 0 三 0 but bs。 夫 0, introduced by 
Butcher (1964a, 1964b)j, are of historical interest. They were thought to be compu- 
tationally attractive, because their last stage ls explicit. In the context of geometric 
integration, much more important are methods for which both 0 和 0 and bs。 天 0. 


Lobatto JUB Methods (Iable 1.4). We consider the quadrature formulas whose 
nodes are the zeros of (1.17). We have cl = 0 and cs = 1. Based on cz2,... ,cs 1 
and D1,5。we consider the discontinuous collocation method. This class of meth- 
ods is called Lobatto IIB (Ehle 1969), and it plays an Important role in geometric 
integration in conjunction with the Lobatto IIA methods of Sect. HI.1.3 (See Theo- 
remIV.2.3 and Theorem VI.4.3). These methods are of order 25 一 2, as the following 
Tesult Shows. 


Theorem 1.9 (Superconvergence). 77Pe discom1izzUots coliocatio1z 11etpod or De 太 
170111011 7.7 jasy 11e sa111e oO1der ds 1je 11der1ying guUad1atU1e Jo7r72ULa. 


Pioof We follow the lines of the proof of Theorem 1.5. With the polynomial v( 萎 
of Deftinition 1.7, and with the defect 


6 人 := 习 扩 一 bu 人 ) 
we get (1.13) after linearization. The variation of constants formula then yields 


ua(t 如 十 癌 一 yto 十 站 = Ri 十 刀 to)(u(to) 一 %0) 


to 十 用 
六 尺 ( 如 十 太 相 (5 十 rs ds， 
to 

Which corresponds to (1.14) 让 (to) = yo. As a consequence of Lemma 1.10 below 
(with K = 0), the integral over 尽 (to 十 几 s)r(s) gives a O(12s 一 1) contribution， 
Since the defect 6(to 十 cipm) vanishes only fori = 2,...,5 一 1,anapplication ofthe 
quadrature formula to 尺 ( 如 十 几 , s)6(s) yields PPDiRR(t 十 几 t 加 )56( 如 ) 十 Pbso(to 十 用 
in addition to the quadrature erron which is O(P2z+1l). Collecting terms suitably, we 
obtain 
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u 人 (三 ) 一 Pbs6( 三 ) 一 5() 三 已 (加 (ulto) 十 Pi6(to) 一 0) 
+OU ) 二 OU 


which, afterusing the definitions ofw(to) andu(t),proves J1 一 V( 三 ) = O(PP+T) 十 
@ 〇 (12s 一 5)， 














Lemma 1.10. 77Pe po1ymomial (toftpe discomtizauous collocation 1nethod (7.78) 
Satisjfies JorteE [to,to 十 尹 azd Jorsuicientty s12011 忆 


la 一 VOD<C Pr Jor =0. .5 一 2. 


Prooj The proof is essentially the Same as that for Lemma 1.0. In the formulas for 
uto 十 TPR) and yt 十 TP), the sum has to be taken from = 2toi=s 一 1. 
Moreover, all 1s become 1s-2. In (1.16) one has an additional term 


Wo 一 uto) = Pi(alto) 一 tou(to)))， 


which, however is just an interpolation error of size O(js-1) and can be included 
in Comst . ]s 一 . 














JI.2 Partitioned Runge-utta Methods 


Some interesting numerical methods introduced in Chap.I (Symplectic Euler and 
the Stormer-Verlet method) do not belong to the class of Runge-Kutta methods. 
They are Important examples of so-called partitioned Runge-Kutta methods. In this 
Section we consider differential equations in the partitioned form 


= 7 2， 之 一 9(2 2)， (2.1D) 


where yV and z may be vectors of different dimensions. 


JI.2.1 Definition and First Examples 


The idea ls to take two different Runge-Kutta methods, and to treat the V-variables 
with the first method (aij, bi), and the z-Variables with the second method (Qi 05). 


Definition 2.1， Let Dj, ai and 已 ,Qij be the coefficients of two Runge-Kutta meth- 
ods. A_Ppa1titiomned Ru118ge-Kutta 111etpod for the Solution of (2.1) is given by 


一 j (加 十 太 》 oj 用 加-+R 050)， 
7 一 1 JJ=1 


6 g( 加 二 六 》 op 放 ， 2 二 650， (2.2) 
J=1 J=1 
1 王 0 十 刀 》 已 24 一 20 十 彤 》 0 
三 学 


II.2 Partitioned Runge-Kutta Methods 39 


Methods of this type were originally proposed by Hofer in 1976 and by Griepen- 
trog in 1978 for problems with stifft and nonstiff parts (See Hairer NUrsett 久 WanneT 
(1993), Sect. II.13). Their Importance for Hamiltonian Systems (See the examples of 
Chap.J) has been discovered only in the last decade. 

An interesting example is the Symplectic Euler method (1I.1.9)，where the im- 
plicit Euler method 0 = 1,alil = 1 is combined with the explicit Euler method 
信 一 1,011 = 0. The Stormer-Verlet method (IL.1.17) is of the form (2.2) with 
coefficients given in Iable 2.1. 


Table 2.1. Stormer-Verlet as apartitioned Runge-Kutta method 





0| 0 0 1/2 | 1/2 0 
1 |1/2 1/2 1/2 1/2 0 
172 /2 1 IT72 IT727 


The theory of Runge-Kutta methods can be extended in a straightforward man- 
ner to partitioned methods. Since (2.2) is a one-step method (Wi,21) = @p(Vo, 20)， 
the Deftinition 1.2 of the order applies directly. Considering problems 7 三 jy)， 
必 一 9g(z) without any coupling terms, we see that the order of (2.2) cannot exceed 
Imin(D, 奴 , where p and 六 are the orders of the two methods. 


Conditions for Order Two. Expanding the exact solution of (2.1) and the numer- 
ical solution (2.2) into Taylor series，we see that the method is of order 2 夺 the 
coupling conditions 


三世 (2.3) 


are Satisfied in addition to the usual Runge-Kutta order conditions for order 2. The 
method of Table 2.1 satisfies these conditions, and it ls therefore of order 2. We also 
remark that (2.3) is automatically Satisfied by partitioned methods that are based on 
the Same quadrature nodes, ij.e.， 


ci 一 台 for all; (2.4) 


where, as usual，ci 一 aij and Ci 一 > Qi . 


Conditions for Order Three. The conditions for order three already become quite 
complicated, unless (2.4) ls satisfied. In this case，we obtain the additional condi- 
tions 

> 050i7Cy 一 1/6， > oai7C7 一 1/6. (2.3) 
The order conditions for higher order will be discussed in Sect.IU.2.2. It turns out 
that the number of coupling conditions increases very fast with order and the proofs 
for high order are often very cumbersome. There ls, however, a very elegant proof of 
the order for the partitioned method which is the most important one in connection 
with“sgeometric integration , as we Shall See now. 
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II.2.2 Lobatto JHIA-IHUB Pairs 


These methods generalize the Stormer-Verlet method to arbitrary order. Indeed, the 
left method of Table 2.1 is the trapezoidal rule, which is the Lobatto IIA method 
with s 一 2, and the method to the right is equivalent to the midpoint rule and, apart 
from the values of the ci, ls the Lobatto IIB method with s = 2. Sun (1993b) and 
Jay (1990) discovered that for general s the combination of the Lobatto IIA and 
IJIUB methods are Suitable for Hamiltonian Systems. The coeffticients of the methods 
fors 王 3aregivenin Table 2.2. Using the idea of discontinuous collocation，we 
give a direct proof of the order for this pair of methods. 


Table 2.2. Coefficients of the 3-stage Lobatto IIA-IIB pair 








0|0 .00 0 0 |1/6 -1/6 0 
1/2 | 5/24 1/3 -1/24 1l/2|1/6 13 0 
1|316 2/3 1/6 1|16 5/6 0 

| 16 2/3 1/6 | 1/6 2/3 1/6 


Theorem 2.2. 77Pe PaFtifioed RU1ge-KM1ia 1aietpod co1posed or 1pe 5s-S1age ZLo- 
patto LA4 ad tjpe s-s1age Lopalio 111B 1netpod 1 oforder 2s 一 2. 


Poojf Let cl = 0,cz,...,cs_ lcs 三 1andb)...,bsbethenodes and weights of 
the Lobatto quadrature. The partitioned Runge-Kutta method based on the Lobatto 
JIUIA-IIUIB pair can be interpreted as the discontinuous collocation method 


如 ) 三 如 一 Pi(olto) 一 gu(to),u(to))) 
t0 人 cij) = Ju 人 (to 十 cij),uolto 十 cij))， 不 三 半 3 


2.6 
已 加 十 cj 一 g(u(to 二 coto 十 ci 站 )， 1 一 2,...,5 一 1 CO 





where u(t andv(b are polynomials of degree s and s 一 2, respectively. This is seen 
as in the proofs of Theorem 1.4 and Theorem 1.8. The Superconvergence (order 
25 一 2) ls obtained with exactly the Same proof as for Theorem 1.9，where the 
functions ww 人 (and y( 轨 have to be replaced with (wu 人 (bu 坟 ) and (yz( 轨 )， 
etc. Instead of Lemma 1.10 we use the estimates (fort E [to,to 十 刀 ) 


芒 锥 丽 三 和 同 | 芝 二 远 本 
ol 人 一 xz 人 | < cpsrR for 大 一 0... ,5 一 2， 


which can be proved by following the lines of the proofs of Lemma 1.6 and 
Lemma 1.10. 
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II.2.3 Nystrom Methods 


Da bis jetzt die wirekte Anwendung der Rungeschen Methode auf den 
wichtigen Fall von Differentialgleichungen zweiter Ordnung nicht behan- 
delt war . . . (上 上.J. Nystrom 1929) 


Second-order differential equations 


一 9 已 甸 切 (2.7) 


form an Important class of problems. Most of the differential equations in Chap.I 
are of this form (e.g., the Kepler problem, the outer Solar System, problems in mole- 
cular dynamics). This is mainly due to Newton's law that forces are proportional 
to Second derivatives (acceleration). Introducing a new variable zx = V for the first 
derivative, the problem (2.7) becomes equivalent to the partitioned System 


2 三 2 站 岂 2 幼 )， (2.8) 
Apartitioned Runge-Kutta method (2.2) applied to this System yields 


计 王 20 十 户 》 0i7O， 
JI 一 1 和 S 
记 一 g(t+ci 80 十 户 》 aijjj 2+P ab (2.9) 
S 二 : S 9 
%1L 王 g0 十 六 >》， 2 2 王 20 十 户 》 Di 
2 记 三 款 


If we insert the formula for Ai; into the others, we obtain Definition 2.3 with 
0 (2.10) 
K 一 1 大 一 1 


Definition 2.3.Let ci, 0， Cij and 已 ,Qij be real coefficients. A_ Nystromz 7metpod for 
the solution of (2.7) is given by 


S S 
4 刀 二 g 人 (ta 二 ci yo+cipnoo 十 大 5aj6， yo 十 hy 70)， 
和 1 


5 二 (2.11) 
多 一 加 +jpgo+ 居 >》 页 = 加 + 六 >》 bb 
小 二 


For the important Special case 了 = 9( 妨 where the vector field does not de- 
pend on the velocity, the coefficients 0ij need not be specified. A Nystrom method is 
oforder2 证 册 一 yt 十 站 = O(PP+1 and 人 一 (to 二 四) 一 O(hP+T).Itisnot suf- 
ficient to consider yl1 alone. The order conditions will be discussed in 9ect. HUI.2.3. 

Notice that the Stormer-Verlet Scheme ([L.1.17) ls a Nystrom method for prob- 
lems ofthe form 刀 = 9 信人 .Wehaves 王 2,andthe coefficients are cl = 0,c2 = 1 
Q11 三 Q12 三 Q22 一 0， Qo21 一 1/2， 2 一 1/2， Do 一 0， and 7 一 也 一 1/2. With 
dgn+1/2 三 gmn 十 和 0n -112 the step (do _1/2,Vn 1/2) rm (qn+l/2) n+l/2) of G.1.17) 
becomes a one-stage Nystrom method with cl = 1/2, mil = 0， 六 一 太 一 工 . 
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JI.3 The Adjoint of a Method 


We Shall see in Chap.V that sy1z111etric numerical methods have many impor- 
tant properties. The key for understanding Symmetry is the concept of the adJozzt 
method. 

The fow w%: of an autonomous differential equation 


= 帮 y)， gt 如 ) = yo (3.1) 


Satisfies 0 一 pf. This property ls 1of# in general, Shared by the one-step map 
2 of a numerical method. An iustration ls presented in the Upper Picture of 
Fig.3.1 (a)，where we See that the one-step map 2 for the explicit Euler method 
is different from the inverse of B_7， which is the implicit Euler method. 


Definition 3.1. The adjoizt metpod G7 of a method Ghn is the inverse map of the 
original method with reversed time Step 一 六 ,ie.， 


Gr :一 GT (3.2) 


一 岂 


(see Fig.3.1(b)). In other words, yi = 下 (yo) is implicitly defned by G_7(V) 三 
2V0.A method for which 27 三 Zn is called syzz117et7zc. 





Fig. 3.1. Deftinition and properties of the adjoint method 


The consideration of adjoint methods evolved independently from the Study of 
Symmetric integrators (tetter (1973), p. 123, Wanner (1973)) and from the aim of 
constructing and analyzing Stifft integrators from explicit ones (Cash (1973) calls 
them “the backward version ”which were the first example of mono-implicit meth- 
ods and Scherer (1977) calls them "reflected methods”). 

The adjoint method satisfies the usual properties such as (57 六 一 Gh and(Gho 
功放 三 玉 o2 for anytwo one-step methods Gand 吃 . The implicit Euler 
method is the adjoint of the explicit Euler method. The implicit midpoint rule ls 
Symmetric (See the lower picture of Fig.3.1(a)), and the trapezoidal rule and the 
Stormer-VYerlet method are also Symmetric. 

The following theorem Shows that the adjoint method has the same order as the 
original method, and, with a possible sign change, also the Same leading error term. 


Theorem 3.2. ZLet pf pe 1je exact jiow of(3.7) azzd /et @Gh pe ad o1e-step 11etpod of 
Oo1derD Satisjyz118 
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有 (go) = Pay 十 CoM 二 OU) G.3) 
7T71e aqdjoipt 1aetjhod G7 加 ea pas 1je sa11e ofderD and We jave 
呈 (go) = jlo) 十 (DCUo)N 二 OU)， G.4) 
太 ri1je 711etpod 58 Sy121etric 118 (1GXI1GL Order 18 eve1. 


Prooj The idea of the proof is exhibited in drawing (c) of Fig.3.1. From a given 
initial value yo we compute ppn(yo) and Vi 三 下 (yo)，wWhose difference e” is the 
local error of 27 .This error is then“projected back” by 2_hn to become e. We See 
that 一 e ls the local error of B_h, ie., by hypothesis (3.3)， 


e= (DPCUn(yo)N2 十 OU 人) G.5) 
Since opn(yo) = 十 O(AJ ande=(T 十 O(A)e*itfollows that 
e 一 (-DzCUOoMP 人 十 OP ) 


which proves (3.4). The statement for Symmetric methods is an Immediate conse- 
quence of this result, because 51r = 57 implies C(Vo) = (一 DZC(yo),andtherefore 
C(yo) can be different from zero only for even 7. 














JI.4 Composition Methods 


The idea of composing methods has some tradition in Several variants: composition 
of different Runge-Kutta methods with the Same Step Size leading to the Butcher 
group, which is treated in Sect. JI.1.3; cyclic composition of multistep methods for 
breaking the“Dahlquist barrier” (See Stetter (1973), p.216); composition of low 
order Runge-Kutta methods for increasing Stability for stifft problems (Gentzsch 公 
Schluter (1978), Iserles (1984)). In the following, we consider the composition of a 
given basic one-step method (and, eventually, its adjoint method) with dzjerent Step 
SizeS. The aim is to increase the order while preserving Some desirable properties 
of the basic method. This idea has mainly been developed in the papers of Suzuki 
(1990), Yoshida (1990), and McLachlan (1995). 

Let gj be abasic method and 71,... ,7s real numbers. Then we call its compo- 
Sition With Step Sizes 说 忆 ,72 帮 .Ts 用 ie.， 


且 王 Bo.oGDp (4.1) 
the corresponding co7Dposi1io7 11ethod (See Fig.4.1(a)). 
Theorem 4.1， ZLel Dj pe ad o1e-step 11etjpod oforderD. 不 


立 寺 + = 1 


0 4 


1Pe1 1je co111DoSi1io1l 1iethod (4.1)1 18 at least oforderD 十 工 . 
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人 OPmiP(2o) 





0 





SG 六 21 So 
Fig. 4.1. Composition of method @n with three Step Sizes 


Prooj The proof is presented in Fig.4.1(b)for s 三 3.Jtis very Similar to the proof 
of Theorem 3.2. By hypothesis 











el = CUo) .PT TO(PT 
ea 一 C() .2HTTMPTTLT 十 OAP+2) (4.3) 
ea = CU) .十 OU 





We have, as before, yi =V0 十 O(P) and 克 =(T+O())eiforall;and obtain, for 
2 三 | 
pojz(0) 一 玖 (0) = 丙 十 本 二 本 =CUOoO 十 认 十 语 )7M2 人 十 OU ) 


which shows that under conditions (4.2) the O(Ah2+1)-term vanishes. 














上 xample 4.2 (The Triple Jump). Equations (4.2) have no real solution for odd 7. 
Therefore, the order increase ls only possible for even D. In this case, the Smallest 
s Which allows a solution ls s 三 3. We then have Some freedom for Solving the 
two equations. I we impose Symmetry ?1 三 73, then we obtain (Creutz 儿 Gocksch 
1989, Forest 1989, Suzuki 1990, Yoshida 1990) 


1 21/(pP+1) 


7 二 3 二 7 5270C+HT， 2 二 7 570 (4.4) 


This procedure can be repeated: we start with a Symmetric method of order 2, apply 
(4.4) with 2p = 2 to obtain order 3; due to the Symmetry of the 7 this new method 
is in fact of order 4 (See Theorem 3.2). With this new method we repeat (4.4) with 
2D =4andobtain asymmetric 9-stage composition method of order 6, then with 
2D 三 6a27-stage Symmetric composition method of order 8, and So on. One obtains 
in this way a1zy order, howeverm at the price of a terrible zig-zag of the step points 
(See Fig.4.2). 


宝 
| 
oo 








1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
| 
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Fig. 4.2. The Triple Jump of order 4 and its iterates of orders 6 and 8 
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了 xample 4.3 (Suzuki's Fractals)， If one desires methods with smaller values of 
Yi， one has to increase 5s even more. For example, for s = 5 the best solution of 
(4.2) has the Sign Structure 十 十 一 十 十 With 7y1 = 72 (See Exercise 7). This leads to 
(Suzuki 1990) 


1 41/(p+D) 
4 





(4.5) 





The repetition of this algorithm for p = 2,4,6,...leadsto afractal Structure of the 
Step points (See Fig.4.3). 











1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
| 
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Fig. 4.3. Suzuki"s“fractal "composition methods 


Composition with the Adjoint Method. If we replace the composition (4.1) by the 
more general formula 


夺 =auspogGBbho.o55joGBaihoGpi (4.0) 


the condition for order p 十 1 becomes, by using the result (3.4) and a Similar proof 
as above， 
0 十 ad 十 万 十 .十 0 十 as 


(4.7) 
人 





This allows an order increase for odd p as well. In particular，we See at once the 
solution al = 0 = 1/2 forp = s = 1 ,which turns every consistent one-step 
method of order 1 into a second-order Symmetric method 


三 到 jao 喧 / (4.8) 


了 上 xample 4.4. I 2h is the explicit (resp. implicib Euler method, then 几 , in (4.8) 
becomes the implicit midpoint (resp. trapezolidal) rule. 


了 Example 4.S. In a second-order problem d = D, 六 = 9g(q), 让 is the sym- 
plectic Euler method，which discretizes 9 by the implicit Euler and P by the ex- 
plicit Euler method, then the composed method 人 in (4.8) is the Stormer-Verlet 
method (I.1.17)， 
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A Numerical Example， To demonstrate the numerical performance of the above 
methods， we choose the Kepler problem (I.2.2) with e = 0.6 and the initial values 
from (IL.2.11). As integration interval we choose [0,7.5], a bit more than one revo- 
lution. The exact solution is obtained by carefully evaluating the integral (1.2.10)， 
Which gives 

%= 王 8.67002632314281495159108828552， (4.9) 


with the help of which we compute ", 2,7 from (1L.2.8) and (1.2.6). This gives 


d1 三 一 0.828164402690770818204757585370 
d2 一 ”0.778898095658635447081654480796 
D1 = 一 0.856384715343395351524486215030 
Da2 三 一 0.160552150799838435254419104102 . 


(4.10) 


As the basic method we use the Yerlet Scheme and compare in Fig.4.4 the perfor- 
mances of the composition Sequences of the Triple Jump (4.4) and those of Suzuki 
(4.5) for a large number of different equidistant basic Step Sizes and for orders 
2 三 4,6,8,10,12. Each basic step ls then divided into 3,9,27,81,243 respectively 
5,25,125,625, 3125 composition Steps and the maximal fnal error ls compared 
with the total number of function evaluations in double logarithmic scales. For each 
method and order all the points lie asymptotically on a Straight line with Slope 一 2. 
Therefore, theoretically, a higher order method will become Superior when the pre- 
cision requirements become Sufficiently high. But we see that for orders 10 and 12 
these“break even points” are far beyond any precision of Practical interest，after 
Some 40 or 50 digits. We also observe that the wild zig-zag of the Triple Jump (4.4) 
is a more Serious handicap than the enormous number of Small steps of the Suzuki 
Sequence (4.3). 

For later reference we have also included, in black Symbols, the results obtained 
by thetwo methods (V.3.11) and (V.3.13) of orders 6 and 8, respectively, which will 
be the outcome of a more elaborate order theory of Chap. IIL. 
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Fig. 4.4. Numerical results of the Triple Jump and Suzuki step sequences (grey Symbols) 
compared to optimal methods (black Symbols) 
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JI.s Spblitting Methods 


The splitting idea yields an approach that is completely different from Runge-Kutta 
methods. One decomposes the vector field into integrable pieces and treats them 
Separately. 








FAALAAAAY 本 中 人 人 人 人 全 人 
0 人 | 
ZU 7ZUZ 一 位 人 7 加 二 2 二 十 上 和 4 四 
AAAAAA 0 444T 人 1 
AAAATAAAAAA ee 4 汪汪 不 不 全 个 人 
AAAAATATAAAATAAA -Te 本 本寺 吉本 








Fig. S$.1. A splitting of a vector field 


We consider an arbitrary System 7 = UV) in 到", and suppose that the vector 
field is “split ”as (See Fig.3.1) 


= Jr+7D(). (5.1) 


开 then, by chance, the exact fows o and 2 加 of the systems 1 = JJ (y) and 
乡下] (y) can be calculated explicitly, we can, from a given initial value yo, first 
Solve the first System to obtain a value V1/2, and from this value integrate the second 
System to obtain 1. In this way we have introduced the numerical methods 


D] 
信 V1/2 ”YA 
2 1 G* 人 
0 4 2 2 分 
本 一 pl oo 2/ ID 6 
0 
0 V1/2 风 

Where one is the adjoint of the other. These formulas are often called the Lie 一 


Tottersplitting (Trotter 1959). By Taylor expansion we find that ( o 昌 O o 问 )(yo) 三 


on(yo) 十 O(1p2), sothatboth methods give approximations of order 1 to the solution 
of (3.1). Another idea is to use a Symmetric version and put 


2j 
/2 六 
Day 
S 1 2 1 
二 久 如 本 G.3) 
玉 
yo 
的 六 /2 
which is known as the Strang SPlittingl (Strang 1968)，and sometimes as the 
Marcjak spliting (Marchuk 1968). By breaking up ip (5.3) p 中 一 2。 o ep 岂 ，， 


1 The article Strang (1968) deals with spatial discretizations of partial differential equations 


Such as it 三 4uz 十 再 There, the functions 上 四 typically contain differences in only 
one Spatial direction. 
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WwWe See that the Strang Splitting 6 三 Zn/2o907 12 is the composition of the Lie- 


Trotter method and its adjoint with halved step Sizes. The Strang Splitting formula 
is therefore Symmetric and of order 2 (See (4.8)). 


了 上 xample S.1 (The Symplectic 上 Euler and the Stormer-Verlet Schemes). Sup- 
pose we have a Hamiltonian system with separable Hamiltonian 万 (p, 9) = 人 (D) 十 
U(d). We consider this as the sum of no Hamiltonians, the first one depending only 
on Dp, the Second one only on 9q. The corresponding Hamiltonian Systems 


) -0 ) 一 Try 
轴 and 5 (5.4) 
4 三 了 p(D) 4=0 
can be Solved without problem to yield 
t) 一 划 二 一 志 CA 
pb = po 2 的 一 po 一 tCe(go) 5 
4 的 = ao 十 t77(po) 4 的 = 90. 


Denoting the fows of these two systems by po7 and 7 , we see that the symplectic 
Euler method (I.1.9) is just the composition 0 O O8 Furthermore, the adjoint of 
the Symplectic Euler method is 0 O 0 ， and by Example 4.5 the Verlet Scheme is 
0 /29 Oo /2 the Strang splitting (5.3). Anticipating the results of Chap. VL the 
flows of and p8 are both symplectic transformations, and, since the composition of 
Symplectic maps is again Symplectic, this gives an elegant proof of the Symplecticity 
of the “symplectic”Euler method and the Verlet Scheme. 


General Spblitting Procedure. In a similar way to the general idea of composi- 
tion methods (4.6)，we can form with arbitrary coefficients al, p1,Q2)...，Qmy bm 
(where, eventually, al Or pm, Or both, are zero) 

一 p 思 ， oO 2， Oo 2 站 ep oO 2 oO 20 (5.0) 
and try to increase the order of the scheme by suitably determining the free coef6- 
clients. An early contribution to this Subject is the article of Ruth (1983), where, for 
the Special case ($.4), a method (9.60) of order 3 with mm = 3 1 constructed. Forest 
贸 Ruth (1990) and Candy 儿 Rozmus (1991) extend Ruth's technique and construct 
methods of order 4. One of their methods is just (4.1) with 说 ,72,73 given by (4.4) 
(2 = 2) and on from (9$.3). A Systematic Study of such methods started with the 
articles of Suzukli (1990, 1992) and Yoshida (1990). 

A close connection between the theories of Splitting methods (3$.6) and of com- 
position methods (4.6) was discovered by McLachlan (1995). Indeed, 让 we put 
0 = al and break up ep 一 2 加 6 2 ) (group property of the exact fow) 


where al ls given in (3.8), further 有 关 1 O 的 and So on (cf. Fig.3.2),， we 


See, Using (9.2), that of ($.0) ls identical with 办, of (4.60), where 


本 = ogj sothat 有 号 9 op @ 


A necessary and sufficient condition for the existence of ai and ii satisfying (3.8) 
is that > ai = 》 mi, which is the consistency condition anyway for method (3.0). 
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a= 太 

bo 三 [LO 十 ad (5.8) 
a 一 al 十 已 

b2 一 02 十 a2 

aa 三 Qa2 十 [03 

03 = [03 





Fig. $.2. Equivalence of splitting and composition methods 


Combining Exact and Numerical Flows. It may happen that the differential equa- 
tion wy = jy) can be split according to ($.1)，such that only the fow of，say， 
= JI(y) can be computed exactly. If FDI(y) constitutes the dominant part of 
the vector field, it is natural to Search for integrators that exploit this information. 
The above interpretation of Splitting methods as composition methods allows us to 
construct Such integrators. We just consider 


2) 一 o 昌 O G 站 ， 9G7 一 6 O o 电 (3.9) 


as the basis of the composition method (4.0). Here o is the exact fow of y 三 


FED(), and 6 加 is some first-order integrator applied to 六 = JP(y). Since Ghn of 
(9.9) ls consistent with (5.1), the resulting method (4.6) has the desired high order. 
It is given by 


信友 Vein 人 5 S 5 允 则 加 0 有 人 9 0 (0G.10) 


Notice that replacing 2 with a low-order approximation 5 in (3.0) would not 


retain the high order of the composition，because 6 does not Satisfy the group 


property. 
Splitting into More than Two Vector Fields. Consider a differential equation 


少 = JJ)+J 加 (人 +.… 二 FI)， (G5.11) 
Where we assume that the fows oO ] of the individual problems yy = 太 (y) can 
be computed exactly. In this case there are many possibijlities for extending (3.0) 
and for writing the method as a composition of 站 间 的 忆 大 .... This makes 
it diftcult to find optimal compositions of high order A Simple and efficient way ls 


to conslider the first-order method 
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2) 一 o 昌 oo 间 oM] 


together with its adjoint as the basis of the composition (4.6). Without any additional 
effort this yields splitting methods for (5.11) of arbitrary high order. 


II.6 玫 Xercises 


1. 


纺 : 


Compnute all collocation methods with s 王 2 as afunction of cl and cz?. Which 
of them are of order 3, whlich of order 42 

Prove that the collocation Solution plotted in the right picture of Fig. 1.3 is com- 
posed of arcs of parabolas. 


.Letb =M = 1/8cs = 1/3, cas = 2/3, and consider the corresponding 


discontinuous collocation method. Determine its order and find the coefficients 
of the equivalent Runge-Kutta method. 


. Show that each of the Symplectic Euler methods in (1.1.9) ls the adjoint of the 


other. 


. (Additive Runge-Kutta methods). Let bi aij and bi,0ij be the coefficients of 


two Runge-Kutta methods. An additive Runge-Kutta method for the solution 
ofz = JW)+JEIG)isgiven py 


1 一 训 (mw 十 刀 》， oj 让 】 十 JD (m 十 六 > 后 商 ) 
了 = 外 站 二 
本 


Show that this can be interpreted as a partitioned Runge-Kutta method (2.2) 
applied to 


小 人 轨 + 站 人， 2 站 全 十 广 人 
with y(0) = z(0) = yo. Notice that y 食 三 2z( 如 . 


. Let Oh denote the Stormer-Verlet Scheme, and consider the composition 


厅 丰 国语 2 全 DG 态 


了 2K 十 1 





with 7 二 四 人 KE 十 2 Te2k+1. Compute 71 and +1 Such 
that the composition gives a method of order 4. For Several differential equa- 
tions (pendulum, Kepler problem) study the global error of a constant Step Size 
implementation as a function of 开 . 





. Consider the composition method (4.1) with s = 5, 35 三 ?1 and74 三 72. 


Amonsg the solutions of 
271 十 272 十 73 三 了 2 放 十 2)2 十 从 一 0 


find the one that minimizes |27 了 十 273 十 ?3|， 
RemaK. This property motivates the choice of the 7i in (4.5). 


Chapter II. 
Order Conditions, Trees and B-Series 


In this chapter we present a compact theory of the order conditions of the meth- 
ods presented in Chap.I, in particular Runge-Kutta methods, partitioned Runge 一 
Kutta methods, and composition methods by using the notion of rooted trees and 
B-series. These ideas lead to algebpraic Structures which have recently found inter- 
esting applications in quantum field theory. The chapter terminates with the Baker- 
Campbell-Hausdorff formula，which allows another access to the order properties 
of composition and splitting methods. 

Some parts of this chapter are rather Short,but nevertheless Self-contained. For 
more detailed presentations we referto the monographs ofButcher (1987), of Hairer， 
Ngwrsett 久 Wanner (1993), and of Hairer 儿 Wanner (1990). Readers mainly inter- 
ested in geometric properties of numerical integrators may continue with Chap- 
ters IJV, V Or VI before returning to the technically more diftcult jungle of trees. 


JIU.1 Runge-Kkutta Order Conditions and B-Series 


Even the standard notation has been found to be too heavy in dealing with 
fourth and higher order processes,. . . (R.H. Merson 1937) 


In this section we derive the order conditions of Runge-Kutta methods by com- 
paring the Taylor series of the exact Solution of (1.1) with that of the numerical 
solution. The computation ls much Simpliftied, frst by considering an aio7zo711O1S 
System of equations (Gill 1951), and second, by the use of rooted trees (connected 
graphs without cycles and a distingulished vertex; Merson 1957). The theory has 
been developed by Butcher in the years 1903-72 (See Butcher (1987), 9ect. 30) and 
by Hairer & Wanner in 1973-74 (See Hairer Ngrsett 儿 Wanner (1993), Sections JI.2 
and I.12). Here we give new Simplified prootfs. 


JIU.1.1 Derivation of the Order Conditions 
We conslider an autonomous problem 


= Jo)， yto)=2%o， (1.1) 


Where 厂 : 棚 "” 一 陈 "” is sufficiently differentiable. A problem 7y = j 纪 轨 ) can be 
brought into this form by appending the equation 寺 王 1. We develop the subsequent 
theory in four steps. 
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Er Sagte es klar und angenehm， 
Was erstens, Zweitens und drittens kam ”. (W. Busch, Jopsiade 1872) 


First Step. We compnute the higher derivatives of the solution y at the initial point 
to. For this, we have from (1.1) 


y9 = (0) 9 (1L2) 


and compnute the latter derivatives by using the chain rule, the product rule, the 
Symmetry of partial derivatives, and the notation 户 (y) for the derivative as a linear 
map (the Jacobian), 大 (zy) the second derivative as a bilinear map and similarly for 
higher derivatives. This gives 


y = jg) 
六 = 万 0)9 人 
8 = 族人 轨 (信念 十 产 O) 
9 = (的 ( 信 冰 及 十 3 六 的 俯 芒 二 六)y 
8 一 了 (人 扫 ( 人 六 六 轨 二 67 (人 六 六 二 4 人 
上 3 族人 信 太 十 瑚 ( 轨 W， 


and So on. The coeffticients 3, 6,4,3,... appearing in these expresSsions have a cer- 
tain combinatorial meaning (number of partitions of a set of g 一 1 elements), but for 
the moment we need not know their values. 


Second Step. We insert in (1.3) recursively the computed derivatives 7, .into 
the right Side of the Subsequent formulas. This gives for the first few 


六 三: 
1 = 户 F (1.4) 

人 

人 


Where the arguments (V) have been Suppressed. The expressions which appear in 
these formulas, denoted by 已 (7)，will be called the elementary di1erentials. We 
ITepresent each of them by a Suitable graph 7 (a Tooted tree) as follows: 

Each 三 becomes a vertex, a first derivative 有 疡 becomes a 
vertex with one branch,， and a kth derivative FU becomes a 了 
vertex with 8 branches pointing upwards, The arguments of the 
K-linear mapping 上 (oO(y) CoOrreSspond to trees that are attached 了 了 
on the Upper ends of these branches. The tree to the Tight cor- 
Tesponds to 态 ( 户 户 旋 . Other trees are plotted in Table 1.1. mm 
the above process, each insertion of an already known derivative 
consists of grafting the corresponding trees upon a new IToot as 
in Deftinition 1.1 below, and inserting the corresponding elementary differentials as 
arguments of /mx)(y) as in Definition 1.2. 
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Table 1.1. Trees, elementary differentials, and coefficients 





7| 本 graph | a(7) | Fr) | 7Yr) g(7) CT) 
1 上 1 5 1 
: 1 矿 / 2 开 ，biai 1 
V 1 矿 ( 记 万 3 2 六 天 Qi7Q 庆 2 
3 []] ) 1 太 广 7 6 > RiQi7Q 和 中 
4 1 六 4 | mbasaikail | 6 
4 | [| 小 3 | FJp 方 | 8 | 二 waiaskai | 1 
4 | [[,。"]] ， 1 | 7 六 了 2 | waijakail | 2 


1 户 广 7 24 2 有 DiQi7Q7JEQKL 工 








1 。 





2 上 





尼 [。,。] 



































Definition 1.1 (Trees). The set of (rooted) eeysT is recursively defined as follows: 
a) the graph 。Wwith only one vertex (called the root) belongs to 了 ; 
b) 于 亡 )….,Tr E ,then the graph obtained 


by grafting the roots of 刻 ，...，,Tr to a new 
Vertex also belongs to 了. Itis denoted by 


示 二 | 交 人 人 光合 
TOOt ”一 一 
and the new vertex ls the Toot of 7. 


We further denote by |7| the order of 7 (the number of vertices), and by a(7) the 
coefficients appearing in the formulas (1.4). We remark that Some of the trees among 
刀 ,...,Tzp Iay be equal and that 7 does not depend on the ordering of 刻 ，. .. ,Tim: 
For example, we do not distinguish between [[。],。] and |。,| .|]]. 


Definition 1.2 (了 lementary Differentials). For atree7 E 了 the elemremia1y dzre 六 
e11ial is a mapping 已 (7) : 了 ”一 及 ", defined recursively by 下 (。)(V) = 太 g 
and 


Fr 的 = 站 mO( 人 ED 的 Fn) 人) 各 了 = 四 mo 


Examples of these constructions and the corresponding coeffticients are Seen im 
Table 1.1. With these definitions, we obtain from (1.4): 


Theorem 1.3. 77Pe dg 太 derivative of tpe exact Solutio1za is give1 Dy 


yt9(to) = >》 ar) Fr)(yo)， (1.5) 


|r|l=4 











Where a(T) are Positive integer coe 太 cie1a15. 
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Third Step. We now turn to the numerical solution of the Runge-Kutta method 
(II.1.4), which, by putting 几 i; = 9ij， we Write as 


and 


由 二 如 十 》 ai 9 儿 = 加 十 》 i9i (7) 
了 


where Wi，0i and Wi are functions of 几 . We develop the derivatives of (1.6)，by 
Leibniz” rule, and obtain gt9) = ja))G + (fa))G 0 .This gives, for 
到 王 0， 

站 (1.8) 


the Same expression as in (1.2)，with y just replaced by w; and with an extra factor 
d. Consequently, exactly as in (1.3)， 


和 i 三 工 .Jo) 
页 = 2. 帮 (oo) 候 (1.9) 
gg = 3 (Poj(so) 十 天 (oo) 这 ) 
go 人 aa) 十 3 人 ai) 二 Fo) 
9 = 5 (Jo 证 ) 十 6 让) 十 4P(Co)(u mo) 
+37(gol(i) 十 Po)u)， 


and So on. Here, the derivatives of 0i and wi are evaluated at 妃 = 0. 


Fourth Step. We now insert recursively the derivatives 如) into (1.9). This 
will give the next higher derivative of 9gi, and, using 
人 (10) 
了 


which follows from (1.7), also the next higher derivative of w. This process begins 
aS 


六 =(1.2) ()aiz) 太 7 让 一 (2) (jxazajp) 肠 让 (1.11) 
and So on. 玉 we compare these formulas with the first lines of (1.4), we see that the 
results are precisely the Same, apart from the extra factors, We denote the ;integer 
Jaciors 1, 1.2,...by 7Y(7) and the factors containing the aij"s by gi(7) and ui(7)， 


Tespectively. We obtain by induction that the Same happens in general, ji.e. that, in 
contrast to (1.5)， 
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g9| = 》、 37(r).gi(r).a(r) Fr)(oo) 


|r|=4 


ul = 》 Yuilr) :alr) Fr)(o)， 


|r|=4 


(1.12) 


Where a(T) and 严 (7) are ie saie quantities as before. This is seen by continuing 
the insertion process of the derivatives ul9) into the right-hand Side of (1.9). For 
example, if and ii are inserted into 3.( 记 ,zi), we will obtain the corresponding 
exXpression as in (1.4), multiplied by the two extra factors ui( / ), brought in by 了 i， 
and ui(。) from 邮 ii. For a general tree7 了 三 [ 记 , .Tom] this will be 


Second, the factors 7(/ ) and 7Y(。) will receive the additional factor 9 = |7| from 
(1.9), ie., we will have in general 


y(T) = |r| 7(r) TY(Tm). (1.14) 


Then, by (1.10)， 


This formula can be re-used repeatedly, as long as Some of the trees 九 ,. ..,Tmr are 
of order > 1. Finally, we have from the last formula of (1.7), that the coefficients for 
the numerical Solution, which we denote by (7) and call the elemmemtary Weigjzs， 
Satisfy 


g(T) 一 》 Digi(7)， (G.10) 
We Summarize the result as follows: 


Theorem 1.4. 77Pe derivatives or 1pe 11ierical SOLUion of a Runge-KUita 11etpod 
(11.7. 孙 Jor 玉 三 0 aye give py 


gg = 》、 37(r) gr) a(r)F(r)(yo)， (17) 


|r|=4 


We1e a(T) and 下 (7T) Gd1e te sa1ae dg 太 Tpeore1a 71.3 1pe coefjicient TY(T) Sa1S 记 
T(。) 三 1a1d (7.74). Te elemmenfary Weigjts g(T) Ga1e opiaiped jjomna 加 e ieeT Ga5 
jllows: attacpn to eyemy verter ad Sa1ta1io1 1etterf( ”1o 1pe1oohl ipeng(T) it1he 
SU112，OVET GLL SU11IG1IO1 DCices OFa Pi1oduct co11Dposed of Di ad Jociors ai Jor 
eacH yertexz “7 ”directny co111iected Wi 太 “ 天 ” Dy Ga1 UDPWaras directed D1a1zc1. 


Prooj Repeated application of (1.15) followed by (1.16) shows that the elementary 
weight (7) is the collection of > ; ww from (1.16) and all > / aij of (1.15). 
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Theorem 1.S. 77Pe Rz128ge-KU1lta 11etjpod jas ofderDp 矿 azad ompy 矿 
bg(7) = 一 一 Jor |rl| < 2. (1.18) 


Prooj The comparison of Theorem 1.3 with Theorem 1.4 proves the sufficiency 
of condition (1.18). The necessity of (1.18) follows from the independence of the 
elementary differentials (See e.g.,， Hairer, Ngrsett 久 Wanner (1993),， Exercise 4 of 
Sect. II.2). 














了 上 xample 1.6，For the following tree of order 9 we have 





工 
> oaijQjmQinQiEQKIQIQIrQHp 一 一 
三 9 也 9 7 5 3 
人 72772357D5Q 
or by using 沁 0 一 
并 ， ?一 元 
iiQ27TCTQIKECKEQKEICI 一 
270 


[人 


The quantities %(7) and 7y(7) for all trees up to order 4 are given in Table 1.1. This 
also verifies the formulas (II.1.0) stated previously. 


II.1.2 B-Series 


We now introduce the concept of B-series, which gives further insight into the be- 
haviour of numerical methods and allows extensions to more general classes of 
methods. 

Motivated by formulas (1.12) and (1.17) above, we consider the corresponding 
seriey as the objects of our study. This means, we study power series in hlr| contain- 
ing elementary differentials 天 (7) and arbitrary coefficients which are now written in 
the form aw(7). Such series will be called B-series. To move from (1.6) to (1.13) we 
need to prove a result stating that a B-series isyerted inio 天 帮 .) 1 a8gai ad BB-series. 
We Start with 


B(o 切 =yTa(*)pjg 二 Ta) 及 扫 二 9 二 9 (1.19) 
and get by Taylor expansion 
jj(B(@ = 局 人 + 司 一 朵 的 二 NO 区 全 二 0120 


Inserting 0 from (1.19) and multiplying out, we obtain the expression 


3 
pf(B(@wg) = if 上 +P2a( JPp+Paa(7)PPF+ 本 人 
区 二 5 
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This beautiful formula is not yetperfect for two reasons. First, there ls a denominator 
2! in the fourth term. The origin of this lies in the yazzaetry of the tree \. We 
thus introduce the symmetry coefficients of Definition 1.7 (following Butcher 1987， 
Theorem 144A). Second, there is no first term yy. We therefore allow the factor a( 幼 
in Definition 1.8. 


Definition 1.7 (Symmetry coefficients). The symmetry coefficients ac(7) are de- 
fined by cl(。) 王 1 and,for7=[ 记 ,7Tm]， 


0lT) 三 Ga(7) pa(Tm) Ho ， (1.22) 
Where the integers /11, /12,.. .count equal trees among 刀 ，...,Tm 


Definition 1.8 (B-Series). For a mapping wa :TU{ 信 一 及 aformal series ofthe 
form 


|7| 
Bug 人 =ag)y+ 习 5 呆 乓 的 (1.23) 
本 E 人 


is called a B-series.1 

The main results of the theory of B-series have their origin in the paper of 
Butcher (1972), although series expanslions were not used there. B-Series were then 
introduced by Hairer & Wanner (1974). The normalization used in Deftinition 1.8 
is due to Butcher 儿 Sanz-9erna (1990). The following fundamental letmma gives a 
Second way of fnding the order conditions. 


Lemma 1.9. Zet wa :TU{ 人 一 月 pea72apping satisjing a(0) = 1. 71en tjpe 
coresDpo11di118 -Series inserted zi 加 几 F) 1 again aB-seriey. 7T1at 1 


hf(B(o 切 ) = Bo 人， (1L24) 
Where w( 册 =0 ao ) 王 上 amd 


a(7T) = a(m) amm) 国庆 人 三 | 疝 和 | (1.25) 


Pioof Since a( 人 ) = 1 we have B(ay) = 十 O(P,sothat nj(B(a) can be 
expanded into a Taylor series around y. As in formulas (1.20) and (1.21), we get 


1 In this section we are not concerned about the convergence of the series. We shall see 
later in Chap.IX that the series converges for sufficiently small 几 , 这 wa(r) satisfies an 
inequality |a(r)| < 7(r)cdlrl and 这 Fo is an analytic function. If Fw) is only K-times 
differentiable, then all formulas of this section remain valid for the truncated B-series 
2rerlrl<k /with asuitable remainder term of Size O(Pe+1) added. 
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AAA(B(wJ)) = 大 >》， oO )(B(c 人 -中 


0 二 0 


5 击 开导 


70 过 0 T1GE 人 人 玖 站 ET 





ra 
0 a( 刻 ) al(Tm) 


|7| 
3 生 全 omDPCOW 


70 过 0 TIET Tm ET2 


中 | 





with7 三 [ .Tom] 


|7| 
= 习 二 olr)Plrjg) = Blog 


厅 生 下 (7) 


The last equality follows from the fact that there are CR 可 ) possibijlities for writ- 
ing the tree 7T in the form 7 = |,…… ,Tom|. For example， the trees [。,。,|。]]， 
[| ,[。],。| and | ],。,。] appear as 是 terms in the upper Sum, but only as 
one term in the lower Sum. 














Back to the Order Conditions， We present now a new derivation of the order 
conditions that is Solely based on B-series and on Lemma 1.9. Let a Runge-Kutta 
method, Say formulas (1.6) and (1.7), be given. All quantities in the defining formu- 
las are Set up as B-series, 9i = Bl(giyo) ua = Bluiyo), 由 = BOo). Then， 
either the linearity and/or Lemma 1.9, translate the formulas of the method into cor- 
Tesponding formulas for the coefficients (1.13), (1.19), and (1.16). This recursively 
justiftes the ansatz as B-Series. 

Assuming the exactsolution to beaB-series B(e,yoj, aterm-by-term derivation 
of this series and an application of Lemma 1.9 to (1.1) yields 


亲 e( 厂 ) .el(Tm)， 


e(7) = 
Together with definition (1.14) of 7yY(7) we thus obtain 
e( 门 = 一， (1.20) 


A comparison of the coefficients of the B-series Wl = 已 (P,yo) with those of the 
exact Solution gives (1.18) and proves Theorem 1.3 again . 

Comparing the B-series 妃 (e,yo) for the exact solution with Theorem 1.3，we 
get as abyproduct the formula 

四 =- (1.27) 
QT) 一 . 
0(T) TY(T) 

Hthe available tools are enriched by the more general composition law of Theo- 
rem 1.10 below, this procedure can be applied to yet larger classes of methods. 
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JIU.1.3 Composition of Methods 


The order theory for the composition of methods goes back to 1969, when Butcher 
used it to circumvent the order barrier for explicit Sth order 3 stage methods. It led to 
the Seminal publication of Butcher (1972),，where the general composition formula 
in (1.34) was expressed Tecursively. 


Composition of Runge-Kutta Methods. Suppose that，starting from an initial 
value yo，we compnute a numerical solution yl1 using a Runge-Kutta method with 
coefficients aij, w and step Size 几 . Then, continuing from 1，we compnute a value 
ya using another method with coefficients az 0 and the same Step Size. This com- 
position of two methods is now considered as a single method (with_ coefficients 
人 Di) The problem is to derive the order properties of this new method, in par- 
ticular to express the elementary weights dg(7) in terms of those of the original two 
methods. 

Hthe value Wi from the first method is inserted into the Starting value for the 
Second method, one Sees that the coefficients of the combined method are given by 
(here written for two-stage methods) 








Q11  Q12 Q11  Q12 

Q21 0Q22 CQ21  Q22 

人 < 人 

al 032 033 034 一 站 bo al ai2 (1.28) 

Q41 Q42 0Q43 0Q44 pp2 a21 a22 
和 二 
并 | 和 


and ouUI problem is to compute the elementary weights of this scheme. 


Derivation. The idea is to write the Sum for 由 7)， Say for the tree 必 , in full detail 


二 二 
jy) = 和 和 0 关 冯 = (1.29) 


and to split each sum into the two different index sets. This leads to 2I7| dif- 
ferent expressions 二 ; 1 半 ; 1 二 1 学 让 大 十 二 : 3 1 5 1 > 了 下 
海 本 | 志 ; 吕 六 | 六 |./. 十 .We symbolize each expression by drawing the 
Corresponding vertex of7 as a pullet for the first index set and as a sar for the Sec- 
ond. However, due to the zero pattern in the matrix in (1.28) (the upper Tight corner 
is missing), each term with “star above bullet "can be omitted, Since the correspond- 
ing 0ij"s are Zero. So the only combinations to be considered are those of Fig. 1.1. 
We finally insert the quantities from the right tableau in (1.28)， 





























和 VD 三 > ” 访 Qi Qi QKL 十 为 0 DR QK7 十 》 辽 Qi DK QKL 十 >》 上 0 a 也 以 
十 六 及 0 和 0 也 帮 十 六 外遇 0 了 0 和 十 2 上 
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1 
昌 : sr :小 
水 


Fig. 1.1. Combinations with nonzero SR 





and we observe that each factor of the type Dj interrupts the Summation, so that the 
terms decompose into factors of elementary weights of the individual methods as 
follows: 


放 史 ) = 下 当 ) 寺 办 人) 风 -J8LO) 二 内 (7 我 六 二 办 (7) 由. 
二 办 (V) 风 ) 十 穴 (用 和) 二 全) ， 


The trees composed of the “star "nodes of7r in Fig.1.1 constitute all possible“sub- 
treesS”0 (from the empty tree to 7 itself) having the Same root as 7T. This is the key 
for understanding the general result. 


Ordered Trees，In order to formalize the procedure of Fig. 1.1, we introduce the 
Set O7 of ordered 1rees Tecursively as follows: 。E O7, and 


这 wwl, ,wm E O7,then also the ordered mm-tuple (wwm) EO7 (1.30) 


As the name suggests, in the graphical representation of an ordered tree the order of 
the branches leaving cannot be permuted. Neglecting the ordering, atreeT ET can 
be considered as an equivalence class of ordered trees, denoted 7 = 届 . 


For example, the tree of Fig.1.1 has two orderings, namely 必 and (4 We 
denote by (7) the number of possible orderings of the tree 7. Itis given by z(。) 一 


1 and 
7 


Z(T) 三 一 一 一 一 Z(7) (Tom) (1.31) 

HLUN2l 人 
for 7 = [六 ,Tom where the integers 1,12, .are the numbers of equal trees 
among 九 ,...,Tm ,This number is closely related to the symmetry coeffcient c(7)， 


because the product (7) = a(7)z(7) satisfies the recurrence relation 
A(T) 三 IlAK(T) .HA(Tm). (1.32) 
We introduce the set OS7(w) of ordered suptrees of an ordered tree w E O7 by 
O4S7(。) = 夫 .} (1.33) 
OST(w) = { 从 U{10 0 0EOS7T(oi)} for w = (wo wm). 


Each ordered subtree 0 < OST7(w) is naturally associated with atreeO < 了 obtained 
by neglecting the ordering and the 0-components of 0. For every tree7T E 了 we 
choose, once and for all, an ordering. We denote this ordered tree by w(7T), and we 
put OS7(7) = OS7(w(7)). 
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For the tree of Fig. 1.1, considered as an ordered tree, the ordered Subtrees cor- 
Tespond to the trees composed of the“star nodes. 


5 The General Rule. The general composition rule now be- 

? comes visible: forO e 0OS7(w) we denote bywN\g the “for- 

est” collecting the trees left over when 0 has been removed 
from the ordered tree w. For brevity we setT\O := w(T)0. 
With the conventions 入 (0) = 驻 (0) and 他 (从 = 1 we 
then have 


5m= > (ww TIIw) 439 


0OcOST(7) OocETNO 





This composition formula for the trees up to order 3 reads: 
4 一 从 (有 .的 十 办 (。) 
87) 二 从 ( 肋 : 风 ] 十 灾 () 罗 二 从 (7) 
g(V) = 从 册 :WV)T 灾 (2 二 297) 人) 十 位 (V) 
2) 二 从 ( 作 : 4() 十 灾 () 2 人 +T 穴 (7) 人 ) 上 + 办 人) 
The tree7 = \W has the subtrees displayed in Fig. 1.2. It contains symmetries in that 


the third and fourth subtrees are topologically equivalent, This explains the factor 2 
in the expression for the elementary weight. 


ww N: 
N\ 7 7 N\ 
NA 7 / 号 

素 ; 1 。 


Fig. 1.2. A tree with symmetry 





JIU.1.4 Composition of B-Series 


We now extend the above composition law to general B-Sseries, jl.e.，we insert the 
B-series themselves into each other, as Sketched in Fig. 1.3. This allows us to gen- 
eralize Lemma 1.9 (because 几 Fo) is a special B-series). 


0 自 介 /2 


Ta 


Fig. 1.3. Composition of B-series 


We start with an observation of Murua (See, e.g., Murua 儿 Sanz-Serna (1999)， 
p. 1083),namely that the proof of Lemma 1.9 remains the same ifthe function 屎 Fo) 
is replaced with any other function 六 9g(y); in this case (1.21) is replaced with 
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3 
pg(B(w 切 ) = 09 十 大 ago 9 二 al 中 人 及 十 可 
jsa(。)a(7)g (PP 记 放 二 


Such Series will reappear in Sect. IIU.3.1 below. Extending this idea further to, Say， 
jj (ulyv2)， where vl, ua are two fixed vectors, we obtain 


六 六 (B(awg)(onva) = 天 Po 二 Ma Fo 用 (1.30) 
+ ja 人 (oo 月 二 于 aa(27 one 六 月 
ja(,)jal7)7 oo 大 六 廊 二 


This idea will lead to a direct proof of the following theorem of Hairer 儿 WanneT 
(1974). 





Theorem 1.10. Zet w :TU{ 人 一 下 pea7apping satisping a( 人 = 1 and 1et 
:TU{ 从 一 玉 pea12ipay Ten tpe B-series 妃 (a,y) iserted into 已 (2 ) 
0G80a10 0 DB-Se1ies 


了 (六 B(w 切 ) = Bob (1.37) 


Were 1 如 e 81oUPD opeation ab(T) 1 a8 到 (7.3 人 革 e.， 


abr)= > ， bb:ar\o wii ar\g= To 4.38) 


0OcOST(7) OocTN\O 
Poof (a) Pn part (c) below we prove by induction on ||,2 Ethat 


丈 I 踢 玉 |7 了 | 
5 亲切 = 放 ; arNg) Fr)(y)， (1.39) 


(7;O)E4(2) (7D) 
Where 四 
4() 三 {(7,g) ;7TE7T0OEOS7T(T) ,0 = 人) 
Multiplying (1.39) by (9) and summing overal ET 三 yields the statement (1.37)- 


(1.38), because 
2 站 


OET (r,9)EA4(Y TET 9EOST(7) 


(pb) Choosing a different 0 of 7 in the deftinition of OS7(7) yields the 
Same Sum in (1.39). Therefore (1.39) ls equivalent to 


下 刀 Iwl 
RCIUaCy) = > -一 alwNb Fo， (40) 


CjiEpo 人) 


Where 四 
(9) ={(wog;iweo7oeOSsT(w);0 = 9)}， 


and (7T) is the number of orderings of the tree T, see (1.31). Functions defined on 
trees are naturally extended to ordered trees. mn (1.40) we use |w| 三 al(w) 王 


arylo) = zhalwN 人 =altrNb,andF(w)()= Fr)(y for 匹 = 
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(c) For 鼠 =。andw = (wwm)wehavea(w\0) 三 a(wi) :pa(wm) 让 
0 一 。. Since we have a one-to-one correspondence (w,0) 二 wbetween 12(。) and 
O7, and Since the expression in the Sum of (1.40) is Independent of the ordering of 
w, formula (1.40) is precisely Lemma 1.9. 

To prove (1.40) for a general tree 9 = |01 .2], we apply the idea put for- 
ward in (1.36) to 户 六 0) (B(a， 2) (ul ...,UD) with fxed ul,...,v，, and obtain as in 
the proof of Lemma 1.9 


pjJO(B(wJ))(0on so) = 》、 >》 


70 过 0 TIET mmE 人 


amTH+H1) ……a(THm) jim((o . mi 


户 


玉 |+1| 十 … 十 |m+m| 十 











an) an+Hm) 


Changing the Sums over trees to Sums Over ordered trees we obtain 


hj0O(Blaa)o = 工 语 工 … 


mm>0 ”wiiIEOT wiiHmEOT 


at arm 国人 En FCormG)， 


玉 |wzt+il 十 … 十 |wz+mm| 十 1 








A(wiH1) KRK(wiHm) 


We insert vi 一 刀 天 (0)(B(awy) intothis relation and we apply ourinduction 


hypothesis 





闷 125| 








|wjl 
次 (本 所有) 三 “古人 


ED 


We then use the recursive definitions of xc(2) and 天 (2)(y) on the leftrhand side. On 
the right-hand side we use the mnultilinearity of FU+m), the recursive definitions of 
lwl, Ar(w), FE(w)( forw = (wwlHm),andthe facts that 


Qa(wAN\ 由 三 QINO0) pa(wNO0) aoa(wiD) pa(wrm) 


and 


0 于/ 


(wl;01)E02(o91) (won)E2(O9) w+HIEOT w+HmEOT (w,0)EIOHm (2) 


Where H11,N2,.. .count equal trees among 91. 和， and fm(9) consists of 
those pairs (w,0) E 2(O) for which w is of the form w = (wwiHm). The 
factorials appear, because to every (! 十 人)-tuple of the leftr-hand sum correspond 
人 ) elements in fV24Hm( 信 ,obtained by permuting the order This yields 


fornu 信 全 40) and hence (1.39). 
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了 xample 1.11， The composition laws for the trees of order < 4 are 





ab(。) 一 以 信 af) 十 凡 *) 
ay ) = 六 屹 :oa ) 二 Wo) 二 DVD) 
ab(V) =O eV)+U) ao +2007) ao)+TAV) 
ui) -5 国 :o 加 二 区 oa(7)+ 区 :es 二 
ab = at + at T3000) ao 十 30V) al ) 
TPCAN) 


一 0 


和 


Remark 1.12. The composition law (1.38) can alternatively be obtained from the 
corresponding formula (1.34) for Runge-Kutta methods by using the fact that B- 
Sefries Which represent Runge-Kutta methods are “dense ”in the Space of all B-Sseries 
(See Theorem 306A of Butcher 1987). 


JIU.1.S The Butcher Group 





John C. Butcher， 
born: 31 March 1933 in Auckland 
(New Zealand) 


The composition law (1.38) can be turned into 
a 810OUD Operafio1, by introducing a Vit ele- 
111CE11 


e( 人 二 1，el7)=0 forre，(1.41) 


and by computing the pverse ele1aze11t of a 
given Q. This is obtained Tecursively from 
the table of Example 1.11，by requiring 
aar-1l(7) = 0 and by inserting the previously 
known values of ao-1( 人 . This gives for the 
first orders 


or1(.) = -al 

or1(7) = -oa(O)+a( 

TV) = -ae(V)+2a(7)a(.) 一 oo 
pri() = -a( 几 二 2a(7jJa() 一 起 
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We can distinguish several realizations of this group: 


CRK the set of Runge-Kutta schemes with composition (1.28); 

CEW the Set of elementary weights of Runge-Kutta Schemes with the composition 
law (1.34); 

GTM the set of tree mappings w :TU{ 信 一 下 satisfying a( 仿 = 工 with 
composition (1.38); 

GBS the set of B-series (1.23) satisfying a( 伯 = 1 with composition (1.37). 


Atechnical difficulty concerns the group CRK, where “reducible” Schemes must be 
identified (by deleting unnecessary stages or by combining Stages that give identical 
Iesuljts) to the same “irreducible” method (See Butcher (1972), or Butcher 多 Wanner 
(1996), p. 140). The deftinition of %(7) in Theorem 1.4 describes a group isomor- 
phism from CRK to CEW, further CEW is a subgroup of CTM and Theorem 1.10 
Shows that formula (1.23) constitutes a group homomorphism from CTM to CBS. 
Because the elementary differentials are independent (See, e.g.,， Hairer NUrsett 公 
Wanner (1993),，Exercise 4 of Sect. II.2), the last two groups are isSomorphic. The 
group CRK can also be extended by allowing“continuous”Runge-Kutta Schemes 
with “infinitely many stages”(See Butcher (1972), or Butcher 贸 Wanner (1990)， 
p.141). The term “Butcher group”was introduced by Hairer 儿 Wanner (1974). 
This paper tells the story of a mathematical object that was created by 


John Butcher in 1972 and was rediscovered by Alain Connes，Henri 
Moscovici and Dirk Kreimer in 1998. (Ch. Brouder 2004) 


Connection with Hopf Algebras and Quantum Field Theory. A surprising con- 
nection between Runge-Kutta theory and renormalization in quantum field theory 
has been discovered by Brouder (2000). One denotes by a Popj aligepra a graded 
algebra which, besides the usual product, also possesSes a cop1oduct atool used by 
H. Hopf(1941) 2 in his topological classification of certain manifolds. Hopf algebras 
generated by families of rooted trees proved to be extremely useful for Simplifying 
the intricate combinatorics of renormalization (Kreimer 1998). Kreimer's Hopf al- 
gebra 7 is the Space generated by linear combinations of families of rooted trees 
and the coproduct is a mapping 人 :他 一 戏 @Lwhichis,for the first trees, given 
b 
六 
AI1)=/7@l+.@.+lIQ@1/ 


A(V)=VYVoel+.。…oQ@.+2.@1/+1Q@YV 


A( 人 =yelryo.+er+le) 
It can be clearly seen, that this algebraic Structure ls precisely the one underlying 
the composition law of Example 1.11, so that the Butcher group CTM becomes the 
Corresponding cparacter 81oxP. The So-called aztipodes oftrees7T E 作 , denoted by 
9(7T)j, are for the first trees 


(1.43) 


2 Not to be confused with E. Hopf, the discoverer of the “Hopf bifurcation”. 


60 JI. Order Conditions, Trees and B-Series 


和 0.44 
5( 力 =-/+27 


and, apparently, describes the imverse eleze1p1t (1.42) in the Butcher group. 





IJIU.2 Order Conditions for Partitioned Runge-Kutta 
Methods 


Wenow apply the ideas ofthe previous Section to the creation of the order conditions 
for partitioned Runge-Kutta methods (1.2.2) of Sect. II.2. These results can then 
also be applied to Nystrom methods，. 


IIU.2.1 Bi-Coloured Trees and P-Series 
Let us conslider a partitioned System 


三 帮 0 2)， 之 一 9(2) (2.1D) 


(non-autonomous problems can be brought into this form by appending t = 1)， 
We Start by computing the derivatives of its exact Solution, which are to be inserted 
into the Taylor series expansion. By analogy with (1.4) we obtain in this case the 
derivatives of y at 10 as follows: 


= 上 

y = 态 / 二 jg (2.2) 
93) = 用 (六 旋 上 2 及 :9)+ 户 99) 二 方太 万 证 亡 gyj 二 万 9z9. 
Here, 万, 万 , 访 :denote partial derivatives and all terms are to be evaluated at 


(yo, 20). Similar expressions are obtained for the derivatives of z( 如 . 

The terms occurring in these expressions are again 
called the elemzentary djerenfials 天 (7)(V 2). For their 让 0 
graphical representation as a tree T7，we distingulish be- 
tween black” vertices for Tepresenting an 厂 and“white” 
vertices for a g. Upwards pointing branches represent par- 
tial derivatives，with respect to y 于 the branch leads to a 六 
black vertex, and with respect to z 让 it leads to a white 
Vertex. With thlis convention, the graph to the Tight corre- 
Sponds to the expression 万 ， (gwz( 凡 9), 丰 ) (See Table 2.1 for more examples). 

We denote by 7 the set of graphs obtained by the above procedure, and we 
call them (rooted) pi-colorvred treey. The first graphs are。and o. By analogy with 
Definition 1.1, we denote by 


yz 1 
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Table 2.1. Bi-coloured trees, elementary differentials, and coefficients 





7| 7 graph | at7) | Fr) | 73(r) g(7) al7) 
1 。 。 1 让 1 有 1 
2| [| 7 1 户 / 2 于 ,paii 1 
2 | [ov 8 1 广 9 2 开 ， 2505 1 
Vi1 john 3 | orbosae | 2 














3 [ 
3 | [ojy 2 | 户 z(jg) | 3 | 2biaiaike | 1 


3 |[oobl MI1 Psg)| 3 17ocrar | 2 












































3 | [jy 1 万 户 j 6 | 2Rioizoaiz | 工 
3 [ojyy / 1 万 户 9 6 2 六 DiaQijGj 1 
3 | [zj 》 1 jgvj 6 | 2 六 Diai | 1 
3 [[oj:y 》 1 jz=09z9 6 2 六 DiQ55G 和 1 
1 o o 1 9 1 并) 访 1 
2 [.]， ! 1 grv 2 于 ,pas 1 
etc etc etc etc 
| 辣 交 | 悄 and 区 人 
the bi-coloured trees obtained by connecting the roots of 九 ,. ..,Tm to anew Toot， 


Which is 。 in the first case, and e in the second. Furthermore,， we denote by 了 证 
and 7TP. the Subsets of 7 which are formed by trees with black and white roots， 
Tespectively. Hence, the trees of 7TP，correspond to derivatives of Vy( 轨 ，whereas 
those of 7TP: correspond to derivatives of z( 加 ， 

As in Deftinition 1.2 we denote the number of vertices of r E 7P by |rl, the 
order of r. The symmetry coefficient o(7) is again defned by 


and, for7 三 [7Tmyor7r = 三 [说 .Tomzby 
alT) 三 a(7) pa(Tm) :HUI (2.3) 
Where the integers /11, /2,... count equal trees among 九 ,...,Tr E 77. This is 


formally the same definition as in Sect. II.1. Observe, however, that a(7) depends 
on the colouring of the vertices. For example, we have o(V) = 2,but of(\) = 1 
By analogy with Definition 1.8 we have: 
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Definition 2.1 (P-Series). For a mapping a : TPU{00} 一 月 aseries of the 
form 


| 
ay+ 和 二 alr) Fr)lyn 


TETPy cf ) 


|7| 
algs)z+ 也 -alr) Pr) 


TET 了 PP- ct7) 





ls called a_P-syeriey. 


The following results correspond to Lemma 1.9 and formula (1.26). They are 
obtained in exactly the Same manner as the corresponding results for non-partitioned 
Runge-Kutta methods (Sect. II.1). We therefore omit their prootfs. 


Lemma 2.2. Lefta : TPU{00 一 玉 satisjh a( 加 ) =a(0>) 三 1.77pen 
JP(o (人 ) 
及 一 忆 la 儿 
(eee 功 )=P(eea 
mwjpere au (0 ) 一 ww(0) =0ao(s)=a(o) 三 1 and 
a(7) = a(T) pa(Tmn)， (2.4) 


矿 either7 三 [Tomply or7=[ Tam]: 














Theorem 2.3 (P-Series of Exact Solution). 77Pe exact solutiomn off2.1)1aP-series 
(y(to 十 jz(t 十 门 ) = Ple,(yo,20)), wheree(ly) =e(0>) = 1and 


el(7T) 一 Joral teE7P (2.9) 


Eee 
TUT) 


Were 1hpe TY(T) Pave 如 e sa1ie values ds Jor 11o11o-colot1ed treey. 














JU.2.2 Order Conditions for Partitioned Runge-utta Methods 
The next result corresponds to Theorem 1.4 and is a consequence of Lemma 2.2. 


Theorem 2.4 (P-Series of Numerical Solution). 7jpe 7z1zerical Solu1iomn of a Pa 六 
1ifio11ed Raz1ge-Kztta 111etpod ( 开 2.2)10P-seriey (2) 一 忆 ( 沪 ， (yo0， 20))， We7e 
90o) = %0z) = 1 and 


> -bbilr) jpor reT7P 
人 人 > bbi(r) Jor re7P. 

TJe expression 内 (7) bdejped 及 内 () = 由 (o)=1andzy 
六-=1aizxgix(T) 站 六 ET 


27=1 0a7jxg7x(TR) 矿 TK ETP。 
C.7) 


(2.0) 


几 (7T) 三 Wi(TD) iT 不 傣 ( 灰 ) 三 | 


源太 丰 过 全 有 和 玉 攻关 全 且 | 关 和 本 交 | 
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Pioof These formulas result from Lemma 2.2 by writing (ii 有 0) from the for- 
mulas (II.2.2) as a P-series (Pi hi) 一 忆 (i (yo， 20)) So that 


(六 >》， Qi 大 刀 》 25) 一 己 ( 泡 ， (yo， 20)) 
了 学 


is also a P-series. Observe that equation (2.0) corresponds to (1.16) (where gi has to 
be replaced with i) and that formula (2.7) comprises (1.13) and (1.19)，where we 
now Write Wi instead of ui;. 














The expressions 0%(7) are Shown in Table 2.1 for all trees in TP, up to order 
Ir| 和 3.A similar table must be added for trees in 7P:, where all roots are white 
and all bw are replaced with 亿 . The general rule is the following: attach to every 
vertex a Summation index. Then, the expression (7) is a sum over all summation 
indices with the summand being a product of 5 or 已 (depending on whether the 


66 199 


Toot “ii black or white) and of oj (让 “1” 府 black) or Cj (过 “1” 让 white), for 


66 99 


each vertex“[” directly above “7 


Theorem 2.S (Order Conditions). 4 Partitiomzed Raz1age-Kzxtta ethod (L2.2) 1as 
orderm ie 一 UV 二 癌 三 OrTD 2 一 2z( 加 十 内 二 OrTD iand om1y 太 


bg(7T) = 一 一 JorrETPUTP:w 矶 | 了 所 7 (2.8) 


Prooj This corresponds to Theorem 1.3 and is seen by comparing the expansions 
of Theorems 2.4 and 2.3. 














了 xample 2.6， We See that not only does every individual Runge-Kutta method have 
to be of order 7, but also the So-called coxPpling comditio1s between the coefficients 
of both methods must hold. The order conditions mentioned above (See formulas 


(1L.2.3) and (HU.2.3)) correspond to the trees 了 人 ,7 and / . For the tree Sketched 
below we obtain 


1 
> bi0i7QjmQinaikQkIQIaarakp 二 一 一 
0 Q 了 9 2 z 5 3 


人 17077257D5Q 


or by using > ; aij 一 ci and 下 ;7 一 





>》， 太公 二 一 We 人 1 
iCiQi7CTQiECEQKICI 一 mm 


2 


JU.2.3 Order Conditions for Nystrom Methods 


A“ “modern ”order theory for Nystrom methods (IIL.2.11) of Sect.I.2.3 was first 
given in 1976 by Hairer 包 Wanner (See 9ect. IJ.14 of Hairer Ngrsett 廊 Wanner 
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1993). Later it turned out that these conditions are obtained easily by applying the 
theory of partitioned Runge-Kutta methods to the System 


0 二 2 0 (2.9) 


which is of the form (2.1). This function has the partial derivative 万 = and all 
other derivatives of 太 are zero. As a consequence, many elementary differentials are 
Zero and the corresponding order conditions can be omitted. The only trees remain- 
ing are those for which 


“Dack vertices ave af 711osft oO11e 8O1 G11G 1118 SO1 11IUS1 De Write (2.10) 


了 xample 2.7， The tree Sketched below apparently satisfies condition (2.10) and the 
Corresponding order condition becomes, by ITheorem 2.4 and formula (2.8)， 


工 
13.12.4.3.2.4.3 


> pi0i7QjKQkmQkhnaGkpajioaorarsajlQttatuatu 一 





Due to property (2.10), each aik inslide the tree comes with a 
Corresponding Qk7 and by (2.10), both factors contract to an 77 7 炎 
0ij Similarly, the black root is only connected to one white 
vertex, the corresponding bi0zj simplifies to 0 We thus get 
万 一 刷 Rs 1 
>》， ICQJKECKECEQITqQ9qsQITECH 人 13 3456 
也 9;5 让 


Each of the above order conditions for atree in 7 has a “twin in 7Pz ofone 
order lower with the root cut off. For the above example this twin becomes 


工 
区 
》 D5QjK5CKCEQJaGasGitCt 一 3456: 
也 有 ;95 让 


We need only consider the trees in 7 大 让 


is Satisfied (See Lemma I.14.13 of Hairer Ngrsett 氏 Wanner (1993), Sect. II.14). 


Remark 2.8. Strictly speaking, the theory of partitioned methods is applicable to 
Nystrom methods only 让 the matrix (Qij) is invertible. However since we arrive at 
expansions with a fnite number of algebraic conditions, we can recover the Singular 
case by a continuous perturbation of the coefticients. 


上 quations without Friction. Although condition (2.10) already eliminates many 
order conditions, Nystrom methods for the general problem 冰 = 9(y,7) cannot be 
much better than an excellent Runge-Kutta method applied pairwise to System (2.9). 
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There 1, however an Important Special case where much more progress 1Ss posSsible， 
namely equations of the type 


也 三 9(y)， (2.11) 


which corresponds to motion without friction. In this case, the function for xz in (2.9) 
is Idepeden of z, and in addition to (2.10) we have a Second condition, namely 


“Pite verticesy ayve oly Diack so18 (2.12) 


Both conditions reduce the remaining trees drastically. Along each branch, there 
oOccur alternating black and white vertices. Ramifications only happen at white ver- 
tices. This case allows the construction of excellent numerical methods of high or- 
ders. For example, the following 13 trees 


了 


asSure order 9, whereas ordinary Runge-Kutta theory requires 17 conditions for this 
order See Hairer Ngrsett & Wanner (1993), pages 291f, for tables, examples and 
references. 


IJIU.3 Order Conditions for Composition Methods 


We have Seen in the preceding chapter that composition methods of arbitrarily high 
order can be obtained with the use of Theorem IIL.4.1. However as demonstrated in 
Fig.I.4.4, these methods are not attractive for high orders. This section is devoted 
to the derivation of order conditions,， which then allow the construction of optimal 
high order composition methods. 

The order conditions forthese methods are often derived via the Baker-Campbell- 
Hausdorff formula. This will be the Subject of Sect. IIU.3 below. Only very recently， 
Murua 儿 Sanz-Serna (1999) have found an elegant theory based on the idea of B- 
Series. This paper has largely inspired the Subsequent presentation. 


JU.3.1 Introduction 
The Principal tool in this section ls the Taylor Series expanslion 
环 ( 切 = 十 pda 人 十 大 oa( 纺 十 放 0a( 纺 十 … G.D) 


of the basic method. The only hypothesis which we require for this method is co7- 
Sisfe1zcy, 1.e., that 


qi(y) = 太 g). (3.2) 


All other functions qi(V) are arbitrary. 
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The underlying idea for obtaining the exzpansions for composition methods ls, in 
fact, very Simple: we just insert the series (3.1), with varying values of ,into itself. 
All our experience from 9%ect. HI.1.2 with the insertion of a B-series into a function 
will certainly be helpful. We demonstrate this for the case of the composition 同 三 
Zn ogguih Appliedto an initial value yo, this gives with (3.1) 


员 三 Eapn(yo)=yo 十 haidi(yo) 十 ja3zda(yo) 十 …. 


(3.3) 
2 三 San) 三 太 十 Paodi(W) 十 ja3da() 十 :…. 











We now insert the first series into the second, in the Same way as we did in (1.353). 
Then, for example, the term j12a3d2(y1) becomes 


轨 三 .十 12a2do(yo)+jNsa2aldg(yo)dai(yo) (3.4) 
4 


几 
十 Ma3ajidj(yo)dz(yo) 十 可 201d (Wo)(d (yo)， qi(yo)) 十 …. 


We see that we arrive at“generalized”B-series, wphere the elementary diffterentials 
contain not only o1me function，but are composed of ;njzzztely 1aa1y functions and 
their derivatives. We Symbolize the four terms written in (3.4) by the trees 


This leads us to the following definition. 





Definition 3.1 (co-Trees, 也 ~- -series). We extend Definitions 1.1 and 1.2 to 7T、， 
the setl of al 7ooted treesy Where eacj vertex pea1y qd Poysilive ;nieger without any 
further restrictton, and use the notation 
,OO,G@,...=thetrees with one vertex; 
[, ,Tri = thetree7r formedby anew root GO connected to 万 ，,Tmi; 
F(O)C = dy); 
Fr)(O = 丰 m( 人 (FE(m)( 人 ,Fnn)( 人 ) forr as above; 


lz| 三 工 十 | 刀 | 十 十 |m|， the number of vertices of 7; 
| = 十 上川 二 | 十 十 | 上， thesum ofthe labels of 7; 
alT) 三 HUilUal pa(7) oa(Tm)， 
Where 11 ,12;,.. .count equal trees among 九 ，...,Trmn， 


the sy1z11let1ry coe 太 cie1t reSspecting the labels; 
1(T) 一 12， the lapel oftpe roof of 7. 


Foramapa:TsU{ 从 一 及 we write 


|7|| 
Bo 人 =algy+ 和 alr)FC) 全 G.5) 


| 
TETco (7T) 


Which extends the 71ofiomz of B-syeriey to the new Situation. 
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卫 xample 3.2.，For the tree 


G@(o 9 
(909 
7T 一 QQg 今 7T= 四 ,mi where 厂 =@,，= (3.60) 
过 UW) 


wehave 
开 (T)OO= 吃 (dd (g(d5( 人 ,ad6(y)a6(y)) 


7T=[@O,I@,@,O@j74，1 了 =6 |r=29， cm) =2， 27) = 和. 
The above calculations for (3.4) are governed by the following lemma. 


Lemma 3.3. Foraseries B-(ay) wiipa( 仙 =1we jave 


Il 
ad(BeJ)= 习 和 全 


where aqw(O) = 三 1 azd 


a(7) = a( 站 ammn) Jor 7= | 万，Tn]i (3.8) 


Prooj This is a straightforward extension of Lemma 1.9 with exactly the Same 
prootf . 














The preceding lemma leads directly to the order conditions for composition 
methods. However, 让 we continue with compositions of the type (UL.4.1), we arrive 
at conditions without real solutions. We therefore turn to compositions including the 
adjoint method as well. 


JIU.3.2 The Ceneral Case 
As in (IIL.4.6), we conslider 

玩 王 Cospog@5jo.ogBoopogGpjhogonoGoih (3.9) 
and we obtain with the help of the above lemma the corresponding 妃 - -series. 
Lemma 3.4 (Recurrence Relations). 771e Jo11owipng co1mpositionsy a1e 已 -yeriey 


( 玫 ooGojo 6 站 GO) = Be(b 


(3.10) 
(Ba oo 20o.oEupnog5 hy 一 已 (ak 人 功 . 


Tjeir coefjcienls ae Jectrsively give py ak(0) 1 ok(O) 三 1 ao(7) 三 0Joralt 
T ET a1d 


只 (7) = ok-i(7) 一 (一 了 Ji 用 (站 )， 


2 (3.11) 
ak(T) = 可 (r) 十 ai) 及 (7 
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Pioof The coefficients ao(7) correspond to the identity map B-(ao,y) = 人 The 
Second formula of (3.11) follows from 


Bo 人 = Go 人 (Be 人 ) = Be 人 + ad 人 (Be 人) 


?之 工 


and from an application of Lemma 3.3. 

The relation 妃 - (py) 三 瑟 3 (B。 (Qk 1 2)， which involves the adjoint 
method, needs a little trick: we write it as 局-(ap 1V) = Gen (B-(Dk， 2)) 
(remember that 27 一 G-7)， apply Lemma 3.3 again, and reverse the formula. This 
gives the first equation of (3.11). 














Adding the equations of (3.11), we get 
ok(7) = ok-i(7) + (oaR 7 一 (一 BJC) 及 (7 G.12) 


Because of 风 (@O) = 1 we obtain 


用 关 示 

天 一 1 天 大 G.13) 
MDO)=> oo 一 >》(-6 = >》 (o-(-00， 

《一 1 一 站 二 二 


The fact that, for 姑 (@O),the sum of (一 5o): is from 1 to ,but the sum of ay is only 
from 1 to 8 一 1, has been idicated py a Prime attached to the summation Symbol. 
Continuing to apply the formulas (3.11) and (3.12) to more and more complicated 
treeS, we qulickly understand the general rule for the coefficients of an arbitrary tree. 


了 xample 3.S. The tree7 in (3.6) gives 


S 开 
"QQ JR onD=> (of- 史 > (oo+ 
4Q 当 m 认 人 ， 6G.4 
< t+ TUI(> os- 有 罗 ) 
入 三 六 好 一 工 2 一 1 


The Order Conditions. The exact solution of y = jjV) is a 已 -series y(t 十 万 ) 三 
刀 (e,yo) (See (1.26)). Since dl(y) = /UV), every 已 -series is also a -series with 
e(7T) = 0 for trees with at least one label different from 1. Therefore, we also have 
4( 如 十 由 = 下-(e,yo), where the coefficients e(7) satisfy e(@O) = 1,e(7) 王 0 让 
iT) > 1, and 


一 一 e(7 站 :el(Tm) for 7 = [万 .，,Tm]1. (3.1S) 
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Theorem 3.6. 77Pe comaposition 1aetjpod 用 (y) = B-(asy) of(3.9) pas orderD 太 
as(T) = e(7) Jor rETe wy 下 E72. (3.16) 


Proof This follows from a comparison of the 已 ~--series for the numerical and the 
exact Solution. For the necessity of (3.16), the independence of the elementary dif- 
ferentials has to be studied as in Exercise 3. 














IIU.3.3 Reduction of the Order Conditions 


The order conditions of the foregoing section are indeed beautiful, put for the mo- 
ment they are not of much use, because of the enormous number of trees in 7 of 
a certain order. For example, there are 166 trees in 7 with |Ir|| 和 6. Fortunately， 
the equations are not all independent, as we Shall see now. 


Definition 3.7 (Butcher 1972, Murua & Sanz-Serna 1999). For two trees in T、， 
4 三 [ua .umjiandu 三 [ul 访 we denote 


ou := 一 [ayUWmy ui ， 4 Xu 一 [Urol oliH (3.17) 


and call them the Butcper prodxcf and 111e18108 Poduct TeSpectively (See Fig. 3.1). 


QPp 中 
知 QODO on ae en 见 en 
Gy ge Gf 
QQ oo ea Di QQ PDL 
四 者 四 凶 李 
亿 亿 2 X 7 


OU VOL 


Fig. 3.1. The Butcher product and the merging product 


The mersging product is associative and commnutative, the Butcher product is 
neither of the two. To Simplify the notation，we write products of several factors 
without parentheses, when we mean evaluation from left to right: 


让 
ouliovo...ous=(((uoui)ov)o...)ovs， (3.18) 


Here the factors v1,...,Vws can be freely permuted. 

All subsequent results concern properties of ak(7) as well as (7T), valid for all 
K. To avoid writing all formulas twice, we replace ak(7T) and 办 (7T) everywhere by 
aneutral Symbol c(7). 
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Lemma 3.8 (Switching Lemma)， 411 ax pk or Zea1aa 3.4 Satish Jor a11 ,VE 
Tt1pe yelatiom 


cl(uou) 十 cl(vuow)=cl(uwj cu) 一 c(u xD)， (3.19) 
Proof The recursion formulas (3.11) are of the form 
al7T) 一 5(7) 十 atr) W(7). (3.20) 


We arrange this formula, for all five trees of Fig. 3.1, as follows: 








aoo) 十 aluou 十 a(& XI 一 aa(u) 
一 bov) 十 buoz) 十 box) 一 bb) 
十 aitOW(uou) 十 aio)(uouw) 十 aito+io)b(u xD 


一 atto)N()b(o) 一 as) 一 astoaito)o(DB(u) ， 


Because of Wu oow) 三 1 太 (b) and yu xwu) 三 以 (wo), the last two Tows 
cancel, hence 


a(7T) satisfies (3.19) 令 pb(7) Satisfies (3.19). (3.21) 











Thus, beginning with ao, then bi, then al, etc., all ak and of must Satisfy (3.19). 





The Switching Lemma 3.8 reduces considerably the number of order conditions. 
Since the right-hand expression involves only trees with |7| < | o v|l, and since 
relation (3.19) is also satisfied by e(7), an induction argument shows that the order 
conditions (3.16) for the trees ovwandwvowareequivalent. The operation ov 一 
ouwconsists Simply in Switching the root from one vertex to the next. By repeating 
this argument， we see that we can freely move the root inslide the graph, and of all 
these trees, only one needs to be retained. For order 6, for example, there remain 68 
conditions out of the original 166. 

Our next Tesults Show how relation (3.19) also generates a consliderable amount 
of reductions of the order conditions. These ideas (for the Special Situation of Sym- 
plectic methods) have already been exploited by Calvo 儿 Hairer (19953b). 


Lemma 3.9. 4ssz1ae 1 妨 at a11 外 of Le11a1110 3.4 Sa1isjy da 7elafiomn or ipe Jo771 


TV 72 
> 4 Tec)=0 (3.22) 
i=1 =1 
Wi al1702i > 0.71e1 Jorany tree Ga ak ad pb Sa1sA 1pe 7elatiom 
和 
>》， 4icl(wouwitouw2o.. .ouimi) 三 0. (3.23) 


池 直 
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Prooj The relation (3.20), wIitten for the tree Wo Wilo Wi2o...otUimiyls 


aoailo .om ) 三 (olo. omy) 


十 at)U(w)B(ui) Du ). 


Multiplying with 4; and summing over ?2 this shows that, under the hypothesis 
(3.22) for b, the relation (3.23) holds for bp 过 and only if it holds for w. The coef- 
ficients ao(7) = 0 for the identity map satisfy (3.22) and (3.23) because mi > 0. 
Starting from this, we again conclude (3.23) recursively for all ak and 耻 . 














The following lemma 3 extends formula (3.19) to the case of several factors. 
Lemma 3.10，For amy 态 7ee 1reey WU QUL ak bp OFLe1a1110 3.4 Sa11sj da 17elatio7m 
cl(uovowW) 二 clwouwot)+clwouwoo=clG cj cuw) 十 .… ， (3.24) 


Where tpe dotfs indicate alizear co1pination ofproduct Ti co) wii lo 十 lvz| 十 
.<| 可 十 ol+|wl anad jor eacpn 1erma at 1east one of tpe vj possessey Gd 1ape1 


La1ger 如 a1z o1e. 7Tje ge1le1al Jorpatia, jor7mm treeyu) .WUm， 1 
7 7 
》 cluouo ouiiouitio. oum) =[eos) +… (3.25) 
2 1 一 1 


Prooj We apply Lemma 3.9 to (3.19) and obtain 
c(wo (ou)) 二 clwo(lvot)=c(wouwoW) 一 co(uxo). (3.26) 
Next, we apply the Switching Lemma 3.8 to the trees to the left and get 


c(wo (oo)) 二 cuovow)=clGoj :cuoW) 一 cx(oo)) 





ct(wo (woua)) 十 cowomu)=clGwoj: cuo 妇 一 c(wx(uoa)). 
Adding these formulas and Subtracting (3.26) gives 
c(uovouw)+cwouwow)+cwouwoo)=cluo)(cluou) 二 ceo) 十 … 


which becomes (3.24) after another use of the Switching Lemma. Thereby, every- 
thing which goes into “十 ...”contains Somewhere a merging product,， whose roots 
introduce necessarily labels larger than one. 

Continuing like this, we get Tecursively (3.23) for all 77. 














In order that the further Simplifications do not turn into chaos, we fx, once and 
for all, a iolal oader 7elation (written <) on 7<-，where we only require that the 
order respects the number of vertices, ie., that 


4 < ， whenever |v| < |. (3.27) 


Similar to the Strategy introduced by Hall (1930) for simplifying bracket expresSions 
in Lie algebras, we define the following subset of 7、. 


3 due to A. Murua, Private communication, Feb. 2001 
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Definition 3.11 (Hall Seb). The 玉 c11 set corresponding to an order relation (3.27) 
is a subset 杂 C 7 defined by 


GEMIMH for 一 123,..， 
TEH 今 thereexist vcE7tV>VU,Suchthat T 王 人 o1. 


了 xample 3.12. The trees in the subsequent table are ordered from left to right with 
Tespect to |7r|,，and from top to bottom within fxed |7|. There remain finally 22 
conditions for order 6. 


A Hall set7t with ||r|| < 6: Not in It are, for example: 

四 CDD 
由 2 cc 和 pa because =v = ; 

CU 
蚂 和 & because 人 is not in /; 
0 

g@ Cn 

全 人 

总 


四 间 
四 加 4 because 4 一 ls not in 7 ; 
J CGO) 


G@ 0 0 


Theorem 3.13 (Murua & Sanz-Serna 1999). For eac17T E Te 1pere ae comsta1aty 
Li inatege1y 70i G11d 117eey 2 ET SUCH IILJDr al ak pk orZLea11a 34 We jayve 


四 由 日 口 
四 





V Ti 
c(7) 一 >》， 4 [[ c(a 让 )， 2i7 所 7 | RS | 区 | 了 |. (3.28) 
人 


Pioof We proceed by induction on |7|. Forr = @ the statement is trivial, because 
Q@O Et. Wethus considerT ET with |7| 之 2, writeitas7 了 一 Wov,andconclude 
through the following two steps. 


Eiz7rst tep. We apply the induction hypothesis (3.28) to v, 1.e.， 
co = > Beo)， ws io5l<lo， (3.29) 
? 了 
To this, we apply Lemma 3.9 followed by the Switching Lemma 3.8: 
c(T) = c(wovu) 一 六 Bicluovilovi2. ooim) 


2 


一 》 Bicloinso(uooao: ooim iD) 十 …: : 


| 
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The“ 十 ...”indicate terms containing trees to which we can apply our induction 
hypothesis. Inslide the above expressions, we apply the induction hypothesis to the 
trees Wo Vilo...ouUini-l,followed once again by Lemma 3.9. We arrive at a huge 
double sum which constitutes a linear combination of expressions of the form 


cl(ua ouWao...o un) (3.30) 


and of terms“ 十 ...”covered by the induction hypothesis. The point of the above 
dodges was io 11Qke SU1e 1at Ga11 UVU2..UWm G1e 态 放 . 


9eco1d 9tep. It remains to reduce an expression (3.30) to the form required by 
(3.28). The trees 2,...,Wm can be permuted arbitrarily; we arrange them in in- 
creasing order ua2 挟 ... 所 mn 

Case 1. 坟 ui > wa2, then by definition wi o ua = WE 7 and we absorb the 
Second factor into the first and obtain a product woWaso...ouwm With Jewer factors. 

Case2. 芝 ui < au 所. ,weshuffe the factors with the help of Lemma 3.10 
and obtain for (3.30) the expression 


-> cluuiomo:.)+TTeco)+… 
这 2 生 1 


With the first terms we Teturn to Case 1, the Second term is precisely as in (3.28)， 
and the terms “十 ...are covered by the induction hypothesis. 

Case 了 Now letwl = < ... .Inthis case,the formula (3.25) of Lemma 3.10 
contains the term (3.30) twice. We group both together, so that (3.30) becomes 


77 


7 
工 下 
2 cuiouomo…)+ 了 [coo)+… 


4 1 一 | 


and We go back to Case 1. Ithe first 如 7ee trees are equal，we group three equal 
termas together and So on. 

The whole reduction process ls repeated until all Butcher products have disap- 
peared. 














Theorem 3.14 (Murua & Sanz-Serna 1999). 7T7je comapositiomn 1aietpod 败 (y) 王 
已 (as,y) of(3.9) as orderD 太 anad omy 太 


as(T) =e(T) jjorrE1w|rl<7D. 
7Tjpe coe 太 cienty e(T) da1e 加 ose of 7Tjpeore1z 3.6. 


Proof Wehave seen in Sect.I.4 that composition methods of arbitrarily high order 
exist. Since the coefficients 4; of (3.28) do not depend on the mapping c(7), this 
together with Theorem 3.6 implies that the relation (3.28) is also Satisfied by the 
mapping e for the exact solution. This proves the statement. 
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了 xample 3.1S$. The order conditions for orders D = 1 ...,4become, with the trees 
of Example 3.12 and the rule of (3.14), as follows: 


S 


Order 1: OO >》 (ak 十 太 ) 三 


Order 2: @ >》 (o 一 C2) =0 


Order 3: >》 (oo 十 CD) 三 0 
>》 (oo 用 az 十 Co) = (3.31) 

三 =-1 

Order 4: >》 (ak 一 DO =0 


大 
(中 十 大 ) 2 ae 十 Do) = 


> (o -人 (oo+ 肝 ) =0 


大 一 生计 


ee ee e ee e 
了 





Where, as above, a _Pp7imie attached to a Summation Symbol indicates that the Sum of 
ol is only from 1l to 丰 一 1, whereas the Sum of (一 Go); is from 1 to K. Similarly, the 
remaining trees of Example 3.12 with ||r|| = 5 and |Ir|l| = 6 give the additional 
conditions for order 9 and 0. 

We Shall see in Sect.V.3 how further reductions and numerical values are ob- 
tained under various assumptions of Symmetry. 


JU.3.4 Order Conditions for Splitting Methods 


Splitting methods, introduced in Sect. JI.3, are based on differential equations of the 
form 


= 万 (y) 十 户 ()， (3.32) 
[1] [2] 


Where the fows pf and of ofthe systemsy = 万 (y) andy = 户 (W) are assumed 
to be known exactly. In this Situation, the method 


二 


is of first order and, together with its adjoint 27 三 o 问 O o， can be used as the 
basic method in the composition (3.9). This yields 


人 (3.33) 


Where 
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一 o 十 应 ， ai 一 ai-1 十 应 (3.34) 


with the conventions ao = 0 and OH1 = 0. Consequently, the Splitting method 
(3.33) ls a Special case of (3.9) and we have the following obvious result. 


Theorem 3.16. UDPose 1Paf 1Pe co1iposition 11etpod (3.9) 1 of order D Jor a1 
Basic 1ietpods GOj ten 加 e SPDUi1i8 11etpod (3.33) wii ai boi give Dy (3.34) 1 oF 
1 太 e Sa11e oO1der 1. 














We now want to establish the reciprocal result. To every consistent Splitting 
method (3.33), i.e.，wWith coefficients satisfying > ;ai 三 >》 ;Wi 三 1, there exist 
unique oi; 0 such that (3.34) holds. Does the corresponding composition method 
have the Same order? 


Theorem 3.17. 矿 a co1msiste1t SD1i18 1etpod (3.33) 1 of ojder D at 1east Jor 
Piople1as of tpe Jo (3.32) wii 1Pe ntegrapie SPU08 


ER Po0=( 0] where 三 小 G.35) 


1je1 1je coj7esPDo11d1118 coO11DOSI1iO1 1Ietpod jhas 1jhe Sa111e order D Jor an arpitrary 
Dasic 11etpod ). 


Proof McLachlan (1995) proves this result in the setting of Lie algebras. We give 
here aproof using the tools of this section. 

a) The fows corresponding to the two vector fields 万 and 户 of (3.35) are 
o 昌 (V) = 十 t 态 (y) and o 户 (W 一 十 t 户 (W), respectively. Consequently, the 


method 2) 一 oh Oo o 问 can be written in the form (3.1) with 


看 殷 = 记功 十 户 加 不作 = 丰 j(( 户 罗 户 轨 )，G39 


The idea is to constructb for every tree7 E 1, functions 91(y2) and qz(W1) such that 
the first component of 忆 (7)(0) is non-zero whereas the first component of 忆 (c)(0) 
vanishes for all oc E 7 different from 7. This construction will be explained in 
part (b) below. Since the local error of the composition method is a 已 ~ -series with 
coefficients as(7) 一 e(7)j, this implies that the order conditions for 7 E Mt with 
|rl gp are necessary already for this very Special class of problems. Theorem 3.14 
thus proves the statement. 

b) For the construction ofthe functions gl1(y2) and gz(V1) we have to understand 
the Structure of 忆 (7)(y) with dk(y) given by (3.36). Consider for example the tree 
7T E Ts of Fig.3.2, for which we have F(7)(y) = 地 ()(di(y),ds( 人 ). Inserting 
ak(Vy) from (3.36), we getby Leibniz rule a linear combination of eight expressions 


Ge {l,2)) 


化 ( 户 , 户 下 ( 户 , 户 ))， 信访 户 扒 ( 户 , 户 ))， 
人 ( 访 , 访 扩 ( 户 , 户 ))， 及 (万 , 扩 ( 户 , 户 ))， 
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WP 


Fig. 3.2. Trees for illustrating the equivalence of the order conditions between composition 
and splitting methods 


each of which can be lidentifed with a bi-coloured tree (See Sect. IIU.2.1，a vertex 
。 Coresponds to 广 and o to 户 ). The trees corresponding to these expressions 
with = 1 are shown in Fig.3.2. Due to the special form of ds(y) in (3.36) and 
due to the fact that in trees of the Hall set 刀 the vertex CD can appear only at the 
end of a branch, there ls always at least one bi-coloured tree where the vertices 。 
are Separated by those of 。and Vice versa. We now Select Such a tree, denoted by 
了, and we label the black and white vertices with {1,2,...} We then let Wi 三 
(2 and yo 一 (32)7 ,where m and rn are the numbers of vertices 
。and o in To, reSpectively. Inspired by “Exercise 4 ”of Hairer NUrsett 多 WanneT 
(1993), page 155, we define the ;th component of gli(yz) as the product of all 区 
where 7 runs through the labels of the vertices directly above the vertex 。 with 
label ?. The function gz(Wi) is deftined similarly. For the example of Fig. 3.2, the tree 


有 孔 yields 
本 节 到 码 
go) = | 吧 | ， 92(1) 三 1 
1 1 


One can check that with this construction the bi-coloured tree 7 ls the only one 
for which the first component of the elementary differential evaluated at y 三 0 is 
different from zero. This in turn implies that among all trees of 7 only the tree 了 
has a non-vanishing first component in its elementary differential. 














Necessity of Negative Steps for Higher Order One notices that all the compo- 
Sition methods (1H.4.6) of oder higher than two with 2Bj given by (II.5.7) lead to a 
Splitting (1IL.3.6) where at least one of the coefficients ai and wm ls negative. This 
may be undesirable, especially when the How o 辣 originates from a partial differen- 
tial equation that ls il-posed for negative time progression. The following result has 
been proved independently by Sheng (1989) and Suzuki (1991) (See also Goldman 


久 Kaper (1996)). We present the elegant proof found by Blanes 儿 Casas (2005). 


Theorem 3.18. 矿 ize splitting metphod (1L5.6) is oforderpDp >3Jorgeneral jl and 
JP tjen afleastone oftjpe ai aad atleast one of1pe ia1e strictiy negalive. 


Proof The condition in equation (3.31) for the tree @) reads 


S S 十 工 
> ( 嗓 +B=0 orakgo >》( 吧 十 习 )=0 
虹 二 息 天 一 | 


(remember that ao = 0 and 0。 1 = 0). Now apply the factthat z3 十 3 < 0 implies 
Z 十 <0andconclude with formulas (3.34). 
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JIU.4 The Baker-Campbell-Hausdorff Formula 


This section treats the Baker-Campbell-Hausdorff (Short BCH or CBH) formula on 
the composition of exponentials. It was proposed in 1898 by J.E. Campbell and 
proved independently by Baker (1905) and Hausdorff (1906). This formula w 记 
provide an alternative approach to the order conditions of composition (Sect. 1I.4) 
and splitting methods (Sect.I.3). For its derivation we Shall use the inverse of the 
derivative of the exponential function. 


JIU.4.1 Derivative of the 匡 xponential and Its Inverse 


Elegant formulas for the derivative of exp and for its Inverse can be obtained by 
the use of matrix commnutators [12,4] = 04 -400. I we suppose 12 fxed, this 
expression defines a linear operator 4 一 [1 人 2, 4 


adp(4) 二 [2, 4， (4.1) 
which is called the adjoint operator (See Varadarajan (1974), Sect.2.13). Let us start 
by computing the derivatives of DA&. The product rule for differentiation becomes 


(二 人 4) 如 一 末 PA-1+ CHFPD82 十 .十 CDp-I 万 ， (4.2) 


and this equals 大 末 V2A -1 话 P and 万 commnute. Therefore, it is natural to write 
(4.2) as 大 万 BR-1L to which are added correction termas involving commnutators and 
iterated commnutators. In the cases 上 一 2and8 三 3wehave 


DTFO+COT = 2FD+adp( 万 ) 
厅 D22+OPFDP+22 厅 = 3 厅 22+3(adp( 万 )2+ad2( 万 )， 


where ad denotes the iterated application of the linear operator ad p. With the 
convention ad0 (万 ) = 瓦 we obtain by induction on K that 


天 一 工 


斤 四 呈 Ge 


i 一 0 
This is seen by applying Leibniz rule to Op+1L 一 人 .Iandbyusingtheidentity 
(adp( 媚 )) = (adp(D)2+ad7 (万 )， 
Lemma 4.1. 77e derivative of exp 92 三 > .kx>o0 十 4 1given pby 
( e 92) 妞 = (de (B))e 人 2 
IP7 XP 二 XPD XP "“， 
WAje1e 了 
dexpp( 万 ] = 》 一 一 ad?( 万 ). 4.4 
pp( 已 ) 人 代 二 可 2( 巧 ) (4.4) 


71e seriey (4.4) co1yefgesy Jor all 11aatricey (12. 
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Proof Multiplying (4.3) by (k0)-- and summing, then exchanging the sums and 
putting7 三 太一 2 一 1yields 


(二 exp 2) 古 2 记 1 癌 (aap( 本 )25 和 


> Di (aap( 且 )27， 


这 0 7J>0 


| 


The convergence of the series follows from the boundedness of the linear operator 
adp (wehave lade2l 么 22|)， 














Lemma 4.2 (Baker 190S). JriPe elige1values of ipe Linear operator ad p ae dijhe 六 














e1t jjo1zz 2b7 Wi 太 LE { 士 1 十 2, .小 ten dexpp 1 ipvertiple. Furtpermaiore， We 
jpave 名 or 2 < Ttpat 
一 1 E 有 
dexpP ( 刀 ) = ad9( 万 )， (4.5) 


Where Bk ae 1pe Bermotli7aaabers dejiaed py > 0(CBk/EDz = /er 一 1 


Proof The eigenvalues of dexpp are = > 0oXe/(E 十 HL = (ex 一 1)/N， 
where Ais an eligenvalue of ad po. By our assumption, the values / are non-Zero, SO 
that wexpp is invertible. By deftinition of the Bernoulli numbers, the composition of 
(4.5) with (4.4) gives the identity. Convergence for | 22|| < rrfolows from |lad cl 和 
2|2|| and from the fact that the radius of convergence of the series for z/(ez 一 ]) 
1$s 2T， 














IIU.4.2 The BCH Formula 


Let 4 and B betwoarbitrary (in general non-commuting) matrices. The problem is 
to fnd a matrix C( 力 , such that 


exp(t4)exp(t) = exp C(. (4.0) 


In orderto getafirstidea ofthe form of C(t, we develop the expression to the left in 
aseries: exp(t4) exp(t) = Tt 4 二 万) 十 生 (42 十 24 有 二 B23)TO(B) =: T 二 X. 
For sufficiently small t (hence | Xis small), the series expansion of the logarithm 
log(T 十 X) 王 和 一 X2/12 十 ...yields amatrix C 人 (为 一 log(T 十 X) 一 克 4 十 万 ) 十 
马 (42 十 24 妃 十 2 - (4 十 万 )2) 十 O(b), which satisfies (4.6). This series has 
apositive radius of convergence, because it js obtained by elementary operations of 
Convergent SerieS， 

The main problem of the derivation of the BCH formula is to get explicit for- 
mulas for the coefficients of the series for C( 坟 ,and to express the coefficients of 
妇女 ,interms ofcommutators. With the help ofthe following lemma, recurrence 
relations for these coef6cients will be obtained, which allow for an easy computa- 
tion of the first terms， 
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John Edward Campbell4 Henry Frederick Baker Felix Hausdorff 


Lemma 4.3. ZLef 4 azd 已 pe (pzo1z-co11a11IU1i28) 11G11ices. 7T1em (4.6) 1o1ds， wpere 
CU stPesoiution oftpe djerential equation 





ade(4 十 万 ) (4.7) 


wii initialyvalue C(0) = 0. Recalltpatadc4= [IC4=C4-4C,andtpat BA 
de11ote 1pe Bermoxl UPDery ay 0 Le1111110 4.2. 


Proof Wefollow Varadarajan (1974), Sect.2.13, and we consider for small s and t 
a Smooth matrix function 2G(s, 妨 such that 


exp(s4)exp(t 妞 ) = exp 2G(s, 国 . (4.8) 


Using Lemma 4.1, the derivative of (4.8) with respect to s 1S 


DZ 
4exp(s4)exp(tB) = qdexpzks (去 Ge ) exp 2G(s, 刘 ， 


So that pz 芝 
三 大志 E | 开 天 
训 =dexpz (=4- 了 [2 本- 汪 二 ad%(4)， (4.9) 





We next take the inverse of (4.8) 


4 和 John Edward CampbellL born: 27 May 1862 in Lisburn, Co Antrim (Ireland), died: 1 Oc- 
tober 1924 in Oxford (England). 

5 Henry Frederick Baker born: 3 July 1866 in Cambridge (England), died: 17 March 1956 
in Cambridge. 

5 Felix Hausdorff, born: 8 November 1869 in Breslau, Silesia (now Wioclaw, Poland), died: 
26 January 1942 in Bonn (Germany). 
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exp( 一 上 B)exp(-s4) = exp( 一 2Z(s))， 
and differentiate this relation with respect to 志 As above we get 


OZ 工 万 开 
如 一 dexp-y(B) 一 巨 二 了 [本 + > 页 ad 和 (有 )， (4.10) 


有 >2 


because ad&z( 刀 ) = (-1*ad 儿 (B) and the Bernoulli numbers satisfy Bk = 0 
for odd > 2. A comparison of (4.6) with (4.8) gives C(b 三 2( 坊 . The stated 
differential equation for C() therefore follows from C( 一 芝 伟 四 十 尝 他 人 ， 
and from adding the relations (4.9) and (4.10). 














Using Lemma 4.3 we can compnute the first Taylor coefficients of C ( 芒 ， 
exp(t4)jexp(tB) = exp(tC， 十 2Co 十 bcCe 二 HLC 二 EC 十. | (4.11) 


Inserting this expansion of C(b into (4.7) and comparing like powers of + gives 





Cl = 4+ 妃 
C。 = 了 4 一 B,4+ 可 = 了 [4 可 
C = 4-B35L4 可 = 瑟 |4L4 可 + 吾 |B,IB,4d] 
二 盖世 吉 |4， [B,[B,4]| (4.12) 
G = = 一 击 |4 革 可 | - 志 |BEB,[B,E 划 ]] 
-两 |4EB,[B,IB, 4 + 南 [ 世 LLL4 可 ]| 
+ |4 [BEB, 4 + 喜 |, EL 可]|. 








Here, the dots ... in the formulas for Ca and C5 indicate simplifications with the 
help of the Jacobi identity 


L4,[EB,C] + [C,L4 相 +[B,IC,4] = 0， (4.13) 


which is verified by straightforward calculation. For higher order the expresslions 
Soon become very complicated. 

The Symmetric BCH Formula。For the construction of symmetric Splitting meth- 
ods it is convenient to use a formula for the composition 


exp( 354) exp(t 已 ) exp(34) 二 exp(t91 十 妈 53 十 妈 95 十. (4.14) 


Since the inverse of the leftrhand side is obtained by changing the Sign of 刀 the 
Same must be true for the right-hand side. This explains why only odd powers of 
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t are present in (4.14). Applying the BCH formula (4.11) to exp(#4) exp(#B3) = 
exp C 人 (加 and a second time to exp(C ( 雪 ) exp( 一 C( 一 为) yields for the coefficients 
of (4.14) (Yoshida 1990) 





o1 = 4+ 了 

吕 皇 -去 |4 革 可 + 二 |B,B,4d 

5 = 4 册 贡 由 可 - 南 | 有 [BEB,[B,IB4| ca 
+ 沿 |4[B,[B,IB, 人 | + 击 | LI 可]| 
-高 凡 4[B, 世 可 + 遍 [ 世 EB[44 可 ]] 


IIS Order Conditions via the BCH Formula 


Using the BCH formula we Present an alternative approach to the order conditions 
of Splitting and composition methods. The main idea is to write the fow of a differ- 
ential equation formally as the exponential of the Lie derivative. 


IIU.S.1 Calculus of Lie Derivatives 


Eor a differential equation 


= JI() +7D()， 


it ls convenient to study the composition of the 


fows om and 2 加 of the Systemas 


= JW)， yy=jDW)， (6.0 


TeSpectively. We introduce the differential op- 
erators (ZLie derivative) 


中、 
= 》 方 U) 二 一 
人 了 7 ( ) DV 





which means that for differentiable functions 
环 :了 ”一 及 mm wehave Wolfgang Grobner” 


DiF (人 = 已 (7 人， G.2) 
It follows from the chain rule that, for the solutions oo 加 (yo) of (5.1)， 


7” Wolfgang Gr6bner born: 11 February 1899 in Gossensass, South Tyrol now Italy), died: 
10 August 1980 in Innsbruck. 
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Flepoo) = (DiP(egGoo)， G53) 


and applying this operator lteratively we get 


总 所 = (DYF)(Pi(oo))， G5.4) 


Consequently, the Taylor series of 忆 (e (yo))， developed at 上 = 0, becomes 


开 
Fegoo) = 半 厂 (DYPGo) = emp(tDD)FGo)， 63) 
>0 


Now, putting 天 (y) = Id(y) = W the identity map, this is the Taylor series of the 
Solution itself 


帮 

oo) = 2 厅 (DTd() = exp(tPiIalyo) G.0) 
KE>0 

If the functions 7 加 (y) are not analytic, but only V-times continuously differen- 

tiable, the series (3.6) has to be truncated and a O(j ) remainder term has to be 

included. 


Lemma S$.1 (Grobner 1900). Zei 2 G1C 2 加 pe 1jpejiows oripe di1rerential equa- 
fionsz = (andzy = jl(, yespectively Fortpeir composition we 1per jpave 


(% o 9 ) (yo) = exp(sDi) exp(tD?) Id(yo)， 


Pioof This is precisely formula (5.3) with 王 1,treplaced with s, and with 严 (y) 三 
D] 


Pi () = exp(tDoz)Id(yo)， 
Remark S.2. Notice that the indices 1 and 2 as well as s and t to the left and right 
in the identity of Lemma 5.1 are permuted. Grobner calls this phenomenon，which 
Sometimes leads to Some confusion in the literature, the“Vertauschungssatz”. 














Remark S.3. The statement of Lemma 3$.1 can be extended to more than two flows. 
If er is the flow of a differential equation my = 7jD(y),then we have 


(9 DR 9 有 ) (yo) =exp(sDi) exp(tD2) exp(uDnmjId(yo)， 
This follows by induction on 77. 


In general, the two operators Pi and 忆 。 do not commnute, So that the composi- 
tion exp(LDi)exp(tDz)Id(yo) is different from exp(t( Di + Da))Id(yo) ,which 
represents the solution or(yo) of 六 = jy) = JW + BDI(). The relation of 
Lemma 3.1 Suggests the use of the BCH formula. However，Di and D2 are un- 
bounded differential operators So that the Series expansions that appear cannot be 
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expected to converge. A formal application of the BCH formula with 14 and 1 
TIeplaced with sD1 and t+Da, respectively, yields 


exp(sDi)exp(tD?) = exp(D(s, 国 )， (S.7) 


Where the differential operator 刀 (s, 旭 is obtained from (4.11) as 


上 过 
D(s = 5D1 十 1D2 十 本 [Di Pa] 二 五 | [D;, Da]| 
(3.8) 
t2 272 
下 [>，， [pz, Dj]| 十 2 | [D>， [Da, Di]| 十 .….， 
The Zie pracket for differential operators is calculated exactly as for matrices， 


namely, [Di, Dao] = DiD。 - DaD1. But how can we interpret (3.7) Trigorously? 
Expanding both Sides in Taylor series we See that 


exp(sDijexp(tD) = TsDiTtD) 二 3 (s2D?+2stDiDs + 记 D3 上 十. (5.9) 
and 


exp(D(s) = 工 +D(s 刘 十 了 D(s,? 十 .…. 





工 
一 了 十 5D1 十 tD2 十 了 ((sP; 十 tD2)? 十 st[ 盖 1， ]) 总 


By derivation of the BCH formula we have a formal identity, i.e., both series have 
exactly the Same coefficients. Moreover, every finite truncation of the series can be 
applied without any diffculties to sufficiently differentiable functions 严 (y). Con- 
Sequently, for NW-times differentiable functions the relation (5.7) holds true, 寺 both 
sides are replaced by their truncated Taylor series andifa O(Ah ) remainderis added 
(由 = 王 max(|s|,|t))， 


JU.S.2 Lie Brackets and Commnutativity 


开 we apply Pa to a function 玉 , followed by an application of 忆 ， we will obtain 
partial derivatives of Fof first and second orders. However 直 we Subtract from this 
the Same expression with Di and D2 reversed, the Second derivatives will cancel 
(this was already remarked upon by Jacobi (1862), p.39:“differentialia partialia 
Secunda functionis 厂 non continere”) and we see that the Lie bracket 





aj 8 站] 8 
[Di Daz| = DiDa - DaDi = 证 | 地 沪 | (3.10) 
了 了 7 7 


is again a linear differential operator So, from two vector fields Fi and 7 we 
obtain a 1Pirdl vector field 1. 
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The geo1aetric 1Iea11118 of the new Vector 站 
field can be deduced from Lemma 5.1. We See 回 Ys 加 
by Subtracting (5.9) from itself, once as it Stands 2 2 
and once with sDP1 and iD permuted, that Vo dg 加 
Bl 9s 


ep 有 op-eDoo 晤 Go) = st[Di Da Idlyo) 上 + = 革 J 国 (go) 二 (5.1) 


(See the picture), where “十 . .areterms oforder > 3.This leads us to the following 
TeSult. 


Lemma S$.4. Zer H(y) azd FD(y) pe dejined om an open set 7Tjhe correspomding 
Jows 2 
矿 


G1C OP CO11111IU1e eVe1yWAeje Jorall suficienty s11als ad 志 矿 azzd om 


[Di, Do] = 0. (5.12) 


Proojf The “only 让” part is clear from (3.11). For proving the“ if” part we take 5 


andtfxed, and Subdivide, for a given m, the integration intervals into 7 equidistant 


parts As = s/m and Al = tn. This allows us to transform the solution 2 加 O 


2 (yo) by a discrete homotopy in mn2 steps into the solution 2 O 2 加 (yo), each 
time appending a small rectangle of size O(m 一 2). If we denote such an intermediate 
stage by 

二 00) 
IT2At 12 人 As 71At 11As 


then we have 70 一 2P ]o。 2 (yo) and 六 va 三 一 oO ]o。 oOP 1(o0) (See Fig.S.1). Now, for 


7 一 oo, We have 是 Re 





IPT 一 人 IE On 一 3)， 


because the error terms in (9.11) are of order 3 at least, and because of the dif- 
ferentiability of the solutions with respect to initial values. Thus, by the triangle 
inequality | 刀 > 一 感 | 和 OO( -0D and the result is proved. 


























Fig. S$.1. Estimation of commnuting solutions 
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IIU.S.3 Splitting Methods 


We follow the approach of Yoshida (1990) for obtaining the order conditions of 
Splitting methods (1H.3.0). The idea is the following: with the use of Lemma 5.1 we 
write the method as a product of exponentials, then we apply formally the Baker- 
Campbell-Hausdorff formula to get one exponential of a series in powers of 岂 . Fi- 
nally，we compare this series with (Di 十 D2)，which corresponds to the exact 
Solution of (9.1). 

The splitting method (II.5.6), Viz.， 


了 一 2 加 ) 9 20 ， Oo 02 问 六 9 .9 eD Oo po 加 Oo oD， (5.13) 


Ta 7 一 


is a composition of expresSlons 国庆 O 0 ) Which, by Lemma 5.1 and by (35.7), can 
be written as an exponential 


oo Ph 一 exp (c jEH1 十 思 7E2 十 oj 所 7 


上 9 入 天 硬 十 四 相同 本 十 o3 好 于 至 十 …)Id， 
Where we use the abbreviations 
村 =Di， 胱 =Do， 开 = iDu Da 到 = 瑟 [Di， [Di, Da]|， 
磁 = 下 [DIDa, Dj， 弄 = 支 [Dilps,[Dx, DJ， 
and the dots indicate O(15) expressions. 
We next define 丈 O) recursively by 
三 天 (5.15) 


so that 歼 (m) is equal to our method (5.13). Aiming to write 灭 (5) also as an exponen- 
tial of differential operators, we are confronted with computing commnutators of the 
exXpressions 忆 ;. We see that [BE1, 本 ] 一 21 [PH BE = 6B, [BE1 = 一 6 
[BE 本 ] = 2P4 and [ 功 ,a3 = 一 2B1 as a consequence of the Jacobi identity 
(4.13). But the other commnutators cannot be expressed in terms of 五 ? . We therefore 
introduce 


工 
2 = 元 [D)， [Di [Di pa， 有 瑟 = 元 [D2， [D2,[D2, D]]|. 
This allows us to formujate the following result. 
Lemma S.S. 77pe 1ze1pod 区 0 dejiped py (5.15) ca pe Jomaally written as 
罗 0) 一 exp{( 守 和 本 十 号 六 三 十 吧 妥 醒 十 车 ) 妥 本 


上 @ 妥 醒 二 cj 风格 十 响 用 本 二 史 j 央 三 十]Id， 
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Where al coejicie11s afe ze1o Jor7 =0, ad whperejor7 之 1 

















Cl 全 cl- 十 07， C 二 cj-1 十 07， 
C1 至 C1 7 十 707 十 cl 1D C 7 -107， 
CG; 一 c 1 十 oo 十 2cij 1aj05 一 3ctj105 
十 (ci 1)2 区 学 ci j 1G5 -105 下 C 7 1023， 
CC 一 C 六 下 oj 了 5 5 4c2 j 107g 和 3c1 ) 105 
十 (@ 和 1)2a 本 ci 1G5 -10 开 ci j 102， 


G11C SI1I1G1 DUL 1O1e CO11DUicated Jo771ULas Jor CC 罗 


Pioof Duetothereversed orderin Lemma $.1 we have to compute exp(4) exp( 忆 )， 
where 4 is the argument of the exponential for 到 0-1) and Bis that of (5.14). The 
Iest is atedious but straightforward application of the BCH formula. One has to use 
repeatedly the formulas for [ 媚 ; , !], stated before Lemma 5.5. 














Theorem S.6. 77e SPLitti128 111etpod (5.73) 58 of oraderD 矿 
证 庆 和 2 凤 生计 5 cpgmm 一 0 Jor 开 =2,...,D anda1 (5.16) 


7je coe 态 cien1s 人 G1e 态 ose dejijzed z0 Le1121110 3.3. 


Prooj This is an immediate consequence of Lemma 3$.3, because the conditions of 
order Dp imply that the Taylor series expansion of 亚 Cm)(oo0) coincides with that of 
the solution px(yo) = exp(P(Di + Da))yo up to terms of size O(P2)， 














A Simplification in the order conditions arises for Symmetric methods (5.13)， 
that js, for coefficients satisfying wm = aiando 三 oforal;z(andpm 三 0). 
By Theorem II.3.2, it is sufficient to consider the order conditions ($.16) for odd 天 
only. 


JU.S.4 Composition Methods 
We now consider composition methods (IIL.4.6), viZ.， 


压 三 BospogB5ho.oG37oEaihnoS5i7 (3.17) 


Where Zhis a firstrorder method fory = 帮 V) and Gy is its adjoint. We assume 
三 一 exp(hCrTM2C2+NaCs+…)Id (5.18) 


with differential operators Cr, and Such that Ca is the Lie derivative operator cor- 
Iesponding to 7 = JW). For the splitting method G7 王 o 问 o o 昌 this follows 
from ($.14), and for general one-step methods this is a consequence of Sect.IX.1 on 


backward error analysis. The adjoint method then satisfies 
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exp (PC 一 12cs 二 jacs 一 .Id (5.19) 


From now on the procedure is similar to that of Sect.IIL5.3. We define 歼 O) recur- 
Sively by 
灭 (0) 一 Id， 玫 O) 一 区 9)， (5.20) 


So that 灭 (m) becomes (5.17). We apply the BCH formula to obtain 


Go exp( BC ER 上 ) exp(ojnCi 卡 克 有 G 十 54 Ja 
= erp((w+D)P 本 +( 星 -克己 梧 


1 
+(o 十 肥大 机 十 5050j(az 十 万 ) 居 本 十 … )Ia 


where 
一 Cr， 本 = [CCo]. 
We then have the following result. 


Lemma S.7. 77e 1ze1pod 区 0) off5.20) cazz pe Jomraally written as 


克 O) 一 exp (5 可 强 帮 训 下 光 而 吉本。 Id 








Where al coejicie1a1s afe ze1o Jor7 = 0, ad wherejory7y 三 1 70 
?1 重 许 产 1 十 oj 十 
?; 二 弃 产 1 下 ad < 0 
3 3 3 3 
17 二 7-1 下 了 南 万 
1 工 
位 ) 一 从 六 1 十 3azBj(ag 十 DJ) 十 1 一 B) 一 5 十 所 7)， 





Proof Similar to Lemma 39.3, the result follows using the BCH formula. 











Theorem S.8. 77Pe cozposifio1 11etpod (3.77) 1 oforderDp 太 


放 训 =1， mm=0 Jor 天 2 .Danda1 人 (5.21) 








THje coe 态 ciem1s 论 六 G1e 1jose dejized ri ZLe11a11a0 3.7/. 








It is interesting to See how these order conditions are related to those obtained 
with the use of trees. The conditions 订 m 三 1and 位 m 三 这 训 0areidentical 
to the first three order conditions of Example 3.15. The remaining condition for 
order 3, 谨 一 0, reads 


7 


770 7T0 天 一 1 天 一 工 
akBk(akr 十 太 ) 十 oa 一 (az 十 态 ) 一 ak 十 ) 》 (oz -02 
人 ) (ox 人 2 

二 革 


大 一 1 大 一 1 一 开 一 | 


0 天 IT70 开 
= >》( 吧 一 B>》 (ao++D)-》 (on 二 DBp) y o 一 语 ) = 
4 4E 


大 一 工 
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This condition ls just the difference of the order conditions for the trees Oo C 〇 D and 
GD o @, whose Sum is zero by the Switching Lemma 3.8. Therefore the condition 
刘 ，= 0is equivalent to (though more complicated than) the fourth condition of 
Example 3.15. 


Symmetric Composition of Symmetric Methods. Consider now a composition 
且 三 Co.ogophogpnogoopho. oh (9.22) 


where Oh ls asSymmetric method that can be written as 
6 二 exp 人 (18， 十 jaSs 上 1555 十 . JIa 


with 91 the Lie derivative operator corresponding to y = JW For the Strang 


Splitting 2 一 人 O o 问 O 人 Such an expanslion follows from the Symmetric 


BCH formula (4.14), and for general Symmetric one-step methods from Sect. IX.2. 
The derivation of the order conditions is similar to the above with 到 (7) defined by 


丈 () 一 更 7， 殉 O) 一 瑟 o 歼 0 一 Do 
so that 歼 (m) becomes (9$.22). 
Lemma 5$.9. 77e 1mretjpod 亚 O) caz pe Jormaaly written as 
GD) 一 exp(ajjP 本 十 oj 由 醒 十 o 由 本 十 o8 由 三 十 .Id 


Where 下 一 碟 =|951151,53]j, andwpereofil= 放 ai =0and 





2 

01j 三 01-1 十 209 

5 5 1/ 31 2 | 3 2 3 4 1 

22 二 02J-1 了 下 了 (3 (al 六 一 ai-1017-1 一 711 十 译 叶 站， 


Prooj The result is a consequence of the Symmetric BCH formula (4.14) with 
Tjj51 十 73135s 十 :andotlj ijABEI 十 oz) 1PEE 二 .intheroles of34and 
t 已 , respectively. 














Theorem S.10. 77Pe comzposifio1zz 1etpod (5.22) 1 oforderD 太 


一 :| ap = 一 0 Jorodd 大 三 3,...,D Gd 01 L. (35.23) 











THje coe 态 ciem1s GQ1e 太 hose dejijzed 胡 Le111110 3.9. 





Symmetric composition methods up to order 10 will be constructed and dis- 
cussed in Sect. V.3. 
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III.6 了 kXercises 


1. Find all trees of orders 5 and 6. 
2. (A.Cayley 1857). Denote the number of trees of order dg by ao. Prove that 


ai 十 02 十 ao 十 ad 十 :一代 一 人 (1 一 2 一 (1 一 2 9 ， 





0 1 2 3 
ac 1 1 2 4 9 20 43 15 286 719 


苍 
= 了 T 
Oo 
< 
请 
5 











3. Independency of the elementary differentials: Show that for every7T E 了 there 
is a system (1.1) such that the first component of (7)(0) equals 1, and the first 
component of F()(0) is zero for all trees 公 天 二 
万 Consider a monotonic labelling of r, and deftine yi as the product over all 
Vi Where 7J runs through all labels of vertices that lie directly above the vertex 
“2 For the first labelling of the tree of Exercise 4 this would be Vi1 三 V273， 
V2 三 1 9as 三 V4, and V4 三 工 . 

4. Prove that the coefficient aw(7) of Defin- 


ition 1.21s equalto thenumber of posSi- 4 4 3 
ble monotonic labellings of the vertices 3 2 2 3 2 4 
of 7，starting with the label 1 for the 


root. For example, the tree |[,],。]| has 
three different monotonic labellings. 
In addition, deduce, from (1.22), the recurslon formula 


amD=( 之。 je am 


| 刻 |,. 2 | 


0 


Where the integers HU11, /12,.. .count equal trees among 九 ,...,Trn and 


全 人 有 全 本 玫 
| 二 ssl 需 齐 | 石上 上 .|Tzl! 


denotes the multinomial coefficient. 
RenaaK In the theoretical physics literature, the coefficients aw(7) are written 
CAM(7T) and callegd“Connes-Moscovici weights”. 

S. 开 we denote by V(7) the number of elements in OS7(7), then show that 


NV(。)=2， NV(m 7Tn)=1+N(m) .NOrn). 


Use this result to compnute the number of subtrees of the christmas tree decorat- 
ing formula (1.34). 471zswer- 6805. 

606. Prove that the elementary differentials for partitioned problems are indepen- 
dent. For a given tree (7 E 7TP, find a problem (2.1) such that a certain compo- 
nent of F(7)(p,9) vanishes for all w E 7TP except for 7. 

ZL Consider the construction of Exercise 3, and define the partitioning of V 
into (p, q) according to the colours of the vertices. 


90 


居 


10. 


11. 
12. 


JI. Order Conditions, Trees and B-Series 


The number of order conditions for partitioned Runge-Kutta methods (II.2.2) 
ls 2ay for order 7, where ar ls given by (See Hairer Ngrsett 多 Wanner (1993)， 
page 311) 





人 工 2 3 4 5 6 下 8 9 10 
Qr 1] 2 7 2 107 458 2058 9498 44987 216598 











Find a formula similar to that of Exerclse 2. 


. For the special second order differential equation 内 = g(y) , and for a Nystrom 


method 
4 二 g( 十 cij 加 十 妇 》， ob) 
人 (6.2) 
V1 王 go 十 jgo 十 岂 》 Fi g = 三 加 十 刀 》 已 b 
i1 5 


consider the Simplifying assumption 


CR 


CNV(T) : > ， icp 2 一 一 2... 
(7) 邱 K( 开 一 1) )》 ) 71， 
S 交 
C7 C7 1 
站 N(C) : > 世 cp-2a， 一 D 了 了 大 一 2 ...,(C. 
(9) 人 二 让 2 





Prove that if the quadrature formula (bwici) is of order p, 让 = wu 一 ci) 
for all i, and ifthe simplifying assumptions CN(T), DN(G) are satisfied with 
27 十 2 之 pand6 十 97 二 2D,thentheNystrom method has order 7. 


.ANys11ro1a 1ietpods of 11aaxi114L order 25. Prove that there exists a one-parameter 


family of s-stage Nystrom methods (6.2) for 纪 = 9( 办 ,which have order 2s. 
ZL Consider the Gaussian quadrature formula and define the coefficients ai 
by CN(s) and by 


与 天 1 
Dicg2ais 三 号 ( 2 宇 
六 人 


for 大 一 2,...，5. 

Prove that the _ coefficient C4 in the series (4.11) of the Baker-Campbell- 
Hausdorff formula is given by C4 = [4,[B,LB,4] /24. 

Prove that the series (4.11) converges for | 直 < mm27(4+IEBI)， 

By ITheorem 5.10 four order conditions have to be Satisfied Such that the Sym- 
Imetric composition method ($.22) ls of order 6. Prove that these conditions are 
equivalent to the four conditions of Example V.3.13. (Care has to be taken due 
to the different meaning of the 7 ) 





Chapter 工 V. 
Conservation of First Integrals and Methods 
on Manifolds 


This chapter deals with the conservation of invariants (first integrals) by numerical 
methods, and with numerical methods for differential equations on manifolds. Our 
investigation will follow two directions. We first investigate which of the methods 
introduced in Chap.II conserve invariants automatically. We Shall see that most of 
them conserve linear invariants, a few of them quadratic invariants，and none of 
them conserves cubic or general nonlinear invariants. We then construct new classes 
of methods, which are adapted to known invariants and whpich force the numerical 
Solution to Satisfy them. In particular，we study projection methods and methods 
based on local coordinates of the manifold defined by the invariants. We discuss 
in Some detail the case where the manifold is a Lie group. Finally，we consider 
differential equations on manifolds with orthogonality constraints, which often arise 
in numerical linear algebra. 


TIV.1 了 上 xamples of First Integrals 


Je nomme intEgrale une 6quation 人 = Comst. telle que Sa diffkrentielle 
au 一 0 Soit vErifiee identiquement par le Systeme des Equations diffkren- 
tielles proposekes . . . (C.G.J. Jacobi 1840, p. 350) 


We consider differential equations 


2 三 jy)， (1L.1) 
Where V ls a vector Or posSibly a matrix. 


Definition 1.1.A non-constant function 7(y) is called a jizst integral of (1.1) 让 


7P(J)7( 人 =0 forally. (1.2) 


This implies that every Solution y( 划 of (1.1D) satisfies T(y(b) = To) = Const. 
Synonymously with“first integral ,the terms IOVvGFIG11L OFT CO118erved GUG1EEY OF 
CoO11S1G121 Of 11ofiom are also Used. 


In Chap.Iwe have Seen many examples of differential equations with invariants. 
For example, the Lotka-Volterra problem (L.1.1) has 7(u,u) = In 一 愉 十 2Inv 一 v 
as first integral. The pendulum equation (I.1.13) has 瓦 (p, q) = 22/2 一 cosq, andthe 
Kepler problem (I.2.2) has two first integrals, namely 妃 and 志 of (1.2.3) and (1.2.4). 
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了 xample 1.2 (Conservyvation of the Total Energy). Hamiltonian Systems are of the 
form 


六 三 一 瑟 v(D, 9)， d= 万 p(D,9)， 
where 万 v = VJ 万 = (aE/aq) 7 and 刀 一 Vp 万 = (9E/ap)7 are the column 
vectors of partial derivatives. The Hamiltonian function 达 (pz,q) is a first integral. 
This follows at once from 万 "(Pp, qd) 三 (85/an， 9 万 /6d) and 





2 | 和 2 
op、 bd 00\、pp， 

了 xample 1.3 (Conservation of the Total Linear and Angular Momentum of 
和 -Body Systems). We consider a System of N particles interacting pairwise with 


potential forces which depend on the distances of the particles. This is formulated 
as a Hamiltonian System with total energy (].4.1), ViZ.， 


1 TV 1 IN 1 一 1 
Epo 人 = 也 元 下 pi+ 呈 六 用人 -ol 
1 一 1 1 一 2 7 一 1 


Here di, pi E 下 3 represent the position and momentum of the ith particle of mass 
mi and Vi(r) (> 7is the interaction potential between the ;th and 7jth particle. 
The equations of motion Tead 








] 和 
禾 一 人 翩 一》 1 (9 g7) 
2 7 一 1 
Where, for1 > J, wehave Zij 三 Zi 一 一 Yi) with ri 三 ||w 一 gj and ii is 


arbitrary, Say Zii 三 0. The conservation of the total /iear 71o11te1ztz11a 书 三 3 25 


and the C18ULG7r 111O11e111U111 隐 一 渤 二 di X Di 1l$ a consequence of the Symmetry 
Ielation Zij 三 Zi: 


Q 

二 2 一 > Vviz(g dj) 一 0 
1 一 1 i=1 7 一 1 

1 二 1 N  N 

五 2 0x 疡 =2 三 xD 9x 呆 (全 一 提 王 0 
1 一 1 i1 1 一 1 7 一 1 


了 xample 1.4 (Conservyation of Mass in Chemical Reactions). Suppose that three 
Substances 4, 妃 , C undergo a chemical reaction such asl 


4 -0 了 (slow) 
芭 

也 + 二 00，、，C+ 上 + (very fastb) 
4 

5+O 了 卫 ，4+C (fas0). 








1 This Ropertsom proplem is very popular in testing codes for stiff differential equations. 
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We denote the masses (or concentrations) of the substances 4, 已 , C by V1, y2, 3， 
TIespectively. By the mass action law this leads to the equations 


人 太一 一 0.04 十 10492y3 
B: 如一 0.04 册 一 104goya 一 3.1077 
全 如 三 3.107 1 


We see that 册 十 加 十 轨 =0,hence thetotal mass TVy) = 十 十 ya is an 
invariant of the System. 


As was noted by Shampine (1986),， such linear invariants are generally con- 
Served by numerical integrators. 


Theorem 1.S (Conservyvation of Linear Invariants). 411 explicit G1G 7Z72D1icit 
民 Z118e- 开 11G 10e 妨 0ods co11Se1Ve Lea1 10VG1IG11、 Pa7Vitio1ed RU108e 一 KUHG 10e 纺 - 
ods (L2.2) co1yerve 1inear inyaria1nts 太 bi 三 记 Joral2 or 太 ije ivariant depe1dsy 
Oo111y Oo.D Or oly o11 9. 


Proof Let T(W) = dy with a constant vector qd, so that dx jy) = 0 for all y. 
In the case of Runge-Kutta methods we thus have dz 1i 王 0, and consequently 
民 人 1 三 吧 y 二 jadI( | 访 ) 一 dyo.The statement forpartitioned methods is 
proved Similarly. 














Next we consider differential equations of the form 
立 = 4(Z) 闷 (1.3) 


where Y can be avector or amatrix (not necesSarily a Square matrix). We then have 
the following result. 


Theorem 1.6. 矿 4(7) 8 skew-symaaetric jpor al Y (ie 47 = 一 4) 1Pen the 
GuUad1atic jzactiom T(7) 一 YY ia iovariant Particulaz 访 e 六 让 G1value 功 
CO11S1S18 Of o1tjo1mo71IQL col111118 (1.e.， 区 2 一 几 妨 e1a tje colu11a111y of ipe SoLutio7 
划 of(07.3) 7e1aaizz ortpomormaal Jr al 


Pioof The derivative of T(Y) 和 了 (2) 囊 = 7 万 十 万?7Y. Thus，we have 
站) = 了 了 (204(0)) =Y74() 了 TY74(7)TY foralyY which van- 
ishes, because 4( 了 ) is skew-symmetric. This proves the statement. 














上 xample 1.7 (Rigid Body). The motion of a free rigid body, whose centre of mass 
is at the origin, is described by the Euler equations 


1 一 alyaya， al = WU2 一 73)/U273) 
Q27371)， Q2 一 (73 至 石 )/(73 厂 ) (1.4) 
1a 一 aay1y2， a = (三 一 7)]OD) 


22 
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where the vector % = (Wiy2,ys)7 represents the angular momentum in the 
body frame, and 7, 172, ya are the Principal moments of inertia (Euler (1738b); see 
Sect. VIIL.3 for a detailed description. This problem can be written as 


人 1 0 V3/7T3 ”一 2/72 41 
2 | =| -%/ 0 %1/ 石 2 | ， (1.5) 
13 /7 一 几 / 石 0 13 


which is of the form (1.3) with a skew-symmetric matrix 4( 疡 ). By Theorem 1.6， 
好 十 邮 十 ianinvariant. A second quadratic invariant is 





17/ 儿 多 级 
Era) = 了 人 | 天 | ， 


Which represents the kinetic energy. 

JInspired by the cover page of Marsden 儿 Ratiu (1999),， we Present in Fig.1.1 
the Sphere with Some of the solutions of (1.4) corresponding to 站 二 2,712 三 1 
and 13 = 2/3. They lie on the intersection of the Sphere with the ellipsoid given 
by 瑟 (my2,y3) = Const. In the left picture we have included the numerical so- 
lution (30 steps) obtained by the implicit midpoint rule with step Size 玫 三 0.3 and 
initial value yo = (cos(1.1),0,sin(1.1)). It stays exactly on a solution curve. This 
follows from the fact that the implicit midpoint rule preserves quadratic invariants 
exactly (Sect.IV.2). 

For the explicit Euler method (right picture of Fig. 1.1,，320 steps with 慷 三 
0.05 and the Same initial value) we see that the numerical Solution Shows a wrong 
quajitative behaviour (it should lie on a closed curve). The numerical solution even 
drifts away from the Sphere. 


NA 
和 
和 
入 
志 
人 





Fig. 1.1. Solutions of the Euler equations (1.4) for the rigid body 
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IV.2 Quadratic Invariants 


Quadratic invariants appear often in applications. Examples are the conservation 
law of angular momentum in NW-body Systems (Example 1.3), the two invariants of 
the rigid body motion (Example 1.7), and the invariant Y7Y of Theorem 1.6. We 
therefore consider differential equations (1.1) and quadratic functions 


QUO) = 光 Cy， C.D 


where C is a symmetric square matrix. It is an invariant of (1.1D) 这 CA) = 0 
for all y. 


TIV2.1 Runge-Kutta Methods 


We Shall give a complete characterization of Runge-Kutta methods which automati- 
cally conserve all quadratic invariants. We first of all consider the Gauss collocation 
methods. 


Theorem 2.1.， 77Pe Caxss 11etpods of secf 下 1.3 (coliocafiozz pased om 1Pe sjijfied 
Zege1ad1e Poly1io11l1aLS) Co118e1Vve 9UGG1G1iC 10VG11G1215. 


Poof Let vv(b be the collocation polynomial of the Gauss methods (Defini- 
tion IL.1.3). Since 区 Qu 人 的) = 2u(07Ca(b, it follows from vw(to) = yo and 
ul(to 十 尹 ) 一 Wai that 


to 十 尺 
oOCwW 一 凤 Cyo =2 站 ut 工 CE 人 dt (2.2) 
to 


The integrand v( 轨 7Cuw(b is a polynomial of degree 2s -- 1, which is integrated 
without error by the s-stage Gaussian quadrature formula. It therefore follows from 
the collocation condition 


uto 十 cij7Cw(to 十 ci 由) 一 ul(to 十 cij)7Cj(u(t 十 ci 站)) 一 0 











that the integral in (2.2) vanishes. 





Since the implicit midpoint rule ls the Special case s 二 1 of the Gauss methods， 
the preceding theorem explains its good behaviour for the rigid body Simulation in 
Fig 1.1. 


Theorem 2.2 (Cooper 1987). 矿 1Pe coejzicientis ofa Range-KUita 11etjpod yatisj， 
Diai7 十 D5Q5 一 g Jor GL 全 / 一 1， 5) (2.3) 
态 ez 这 co1nserves guUad1afic iva7iarats.? 


2 For irreducible methods, the conditions of Theorem 2.2 and Theorem 2.4 are also neces- 
Sary for the conservation of all quadratic invariants. This follows from the discussion in 
Sect. YI.7.3. 
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Prooj The proof is the same as that for B-stability, given Independently by Burrage 
信 Butcher and Crouzelix in 1979 (See Hairer 氏 Wanner (19906), Sect.IV.12). 
The relation Wi = yo 十 几 六 ;1 wki of Definition IIL.1.1 yields 


好 Ci = 册 Cgo 上 + 》 大 达 Coyo+》 放 多 CM 二 忆 》 成 帮 条 CC2.4) 
1 一 1 了 一 1 让 7 一 1 
We then write 语 王 帮 斑 ) with 到 = yo 十 六 2 -1 aijk7j. The main idea is to 
compute yo from this relation and to insert it into the central expresSlions of (2.4). 
This yields (using the symmetry of C) 
凶 CWi = 多 Co 十 于 》 放 卫 CH 十 妥 》 (0 一 bi 一 态 oji) 多 C1 
1 一 1 27 一 1 

The condition (2.3) together with the assumption VC = 0, which states that 
COCW is an invariant of (1.1), imply 池 Cw = 网 Cy0o. 














The criterion (2.3) ls very Testrictive. One finds that among all collocation and 
discontinuous collocation methods (Definition HU.1.7) only the Gauss methods Sat- 
isfy this criterion (Exercise 0). On the other hand, it is possible to construct other 
high-order Runge-Kutta methods satisfying (2.3). The key for such a construction is 
the 作 -transformation (See Hairer 久 Wanner (19906), Sect.IV.3$), which is exploited 
in the articles of Sun (1993a) and Hairer & Leone (2000). 


TIV.2.2 Partitioned Runge- 下 utta Methods 


We hext consider partitioned Runge-Kutta methods for systems 7 一 02)， 
2 一 9g(y2. Usually such methods cannot conserve general quadratic invariants 
(Exercise 4). We therefore concentrate on quadratic invariants of the form 


Q@(/z)= 风 Dz， (2.5) 


where Dis a matrix of the appropriate dimensions. Observe that the angular mo- 
mentum of NW-body Systems (Example 1.3) is of this form. 


Theorem 2.3 (Sun 1993b). 71e ZLopatio 1114 -70B pair co1zlservey QU ggUad1atic 
11VG11G11S Of 1je 1jo1111 (2.5). .110 DaQ11icUaF 1118 1 117rUe Jortpe 91O1r11e/ 一 WeFiet scne111e 
(See 9ecf 112.2) 


Poof Letu(b and uv(b be the (discontinuous) collocation polynomials of the Lo- 
batto IIA and Lobatto IIIB methods, respectively (See Sect. II.2.2). In analogy to 
the proof of Theorem 2.1 we have 


Qu( 如 上 + 加 ,ut 如 十 内 ) 一 Qulto),o(to)) 
to 十 彤 
= / (Ga(D,o(D)+Q(ue(D,a(D)) @ 


to 


(2.0) 
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Since ut) is of degree s and v( 切 of degree s 一 2, the integrand of (2.6) is a poly- 
nomial of degree 2s5 一 3. Hence, an application of the Lobatto quadrature yields the 
exact Iesult. Using the fact that CQ(y, >) is an invariant of the differential equation， 
ie,Q( az) +TQ(y9(a) 三 0, we thus obtain for the integral in (2.6) 


PiQ@(ulto),6(t 如 )) +PpsQ@(u(to 十 站 ,56( 加 十 六)， 


where 6 信 一 2 一 g(u(b,u( 坟 ) denotes the defect, It now follows from vw(to) 一 
Vo, u(t 如 十 几 ) = 人员 (definition of Lobatto IIA) and from v( 如 ) = 20 一 RD10(to)， 
ulto 十 由 三 十 ]10s0(to 十 尹 (definition of Lobatto IIB) that Q( 2) 一 
Q(yo, z0) = 0, which proves the theorem. 














Exchanging the role of the IIA and IIJB methods also leads to an integrator 
that preserves quadratic invariants of the form (2.3). The following characterization 
extends TIheorem 2.2 to partitioned Runge-Kutta methods. 


Theorem 2.4. 矿 1Pe coejicie11sy of a Partifioned Rnge-Kx1ia 11etjpod (11.2.2) Sat- 
1 

5 十 态 oj = 人 Jor 5， C.7) 
二 页 Jori 一 1 s， (2.8) 


1Pe1 it Co118e1Vesy GUQd1aQ1iC IIVGFIG1H1S OF1pe Jo71a (2.2). 
太 1pe partiftioned di1jerential equation is oftjpe specialjory = 帮 2=9()， 
加 e1l co11difiom1 (2.7) alo1e IDPlies 态 atf va1ia111sy of ipe jo7111 (2.3) a1e co11ye1vedL 


Proof The proof is nearly identical to that of Theorem 2.2. Instead of (2.4) we get 
开 Da = 多 Da 上 +A》 大 克 Da 》 帮 克 DOT 忆 》 50 克 DO 
一 1 JJ=1 7 一 1 
Denoting by (到 , 丈 ) the arguments of 太 三 2 万) and li = 9( 玫 ,2), the same 


trick as in the proof of Theorem 2.2 gives 


王 D = 多 Dao 十 及》 于 7 思 7D 太 十》 帮 于 Do( 帮 ,万 ) 





i1 生 1 
十 凡 号 (2 一 805 一 Daii) 17 DO (2.9) 
2 一 1 


Since (2.5) is an invariant we have Fo 27Dz+oDolyz)=0forallyand >. 
Consequently, the two conditions (2.7) and (2.8) imply yi 2 = 区 忆 20. 

For the Special case where 厂 depends only on xz and 9 only on V, the assumption 
j 帮 oO7Dz+oDo() = 0 (orally 2 implies that /zi7Dz = -Do(y) = 
Co1st. Therefore, condition (2.8) is no longer necesSary for the proof of the state- 
Iment. 
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ITIV.2.3 Nystrom Methods 


An important class of partitioned differential equations is w = 2, 之 = 0(V) or， 
equivalently， 


少 一 9(9)， (2.10) 


Many examples of Chap.I are of this form, in particular the NW-body problem of 
Example 1.3 for which the angular momentum is a quadratic first integral. Nystrom 
methods (Deftinition II.2.3)， 


万 三 g( 加 +cij 加 十 尼 》 ax 的 )， 

ER (2.11) 
%1L 二 go 二 jgo+ 忆 >》 Bi 信 王 加 十 彤 》 Bi 人 

i1 记 1 


are adapted to the numerical solution of (2.10) and it is interesting to investigate 
which methods within this class can conserve quadratic invariants. 


Theorem 2.S. 矿 1pe coe 访 cie1atsy of ipe NMNys1ro1a 1ietpod (2.171) Sat1z 放 


Di = wu 一 ci) 人 


C.12) 
bi 人 (O1 一 ai) = 008 一 ai 加 or 27 一 15， 


态 ezn 广 comserves G11 guUacyatic invariana1s or zjpe jpzz DT 


Proof The quadratic form QU D) = DY is afirst integral of (2.10) 证 and only 
让 
ID1yJ+WDg 几 =0 forall yy en" (2.13) 


This implies that D is skew-symmetric and that 7 9(y) = 0. 

In the Same way as for the proofs of Theorems 2.2 and 2.4 we now com- 
pute 4 万 们 using the formulas of (2.11) and we substitute yo by 疙 一 cijoo 一 
12 》， 1051 人， Where 8i denotes the argument of g in (2.11). This yields 


好 万 页 = 多 万 加 十 大 闻 万 加 二 六》 用 2 万 6 
| 


二 有 >》 记 好 加 十 刀 》 1 一 加 了 6 
三 十 2 一 工 





二 局 》 528 一 oODL 


i7=1 


Using the skew-symmetry of D and Y7DA = 太 丰 9( 克 ) = 0, condition (2.12) 
implies the conservation property 万 页 = 友 万 20. 
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Remark 2.6 (Composition Methods). If a method 2 conserves quadratic invari- 
ants (e.g.,the mid-pointrule by Theorem 2.1 orthe Stormer-Verlet Scheme by Theo- 
rem2.3 oraNystrom method of Theorem 2.5),then So does the composition method 


萝 一 更 ooG (2.14) 


This obvious property is one of the most Important motivations for conslidering com- 
position methods. 


IV.3 Polynomial Invariants 


We consider two classes of problems with polynomial invariants for degree higher 
than two. First, we treat linear problems for which the determinant of the resolvent ls 
an invariant, and we Show that (partitioned) Runge-Kutta methods cannot conserve 
them automatically. Second, we Study lsospectral fows. 


IV.3.1 The Determinant as a First Integral 
We consider quasi-linear problems 
立 = 4(7) 世 YY(0)= 芒 (3.1) 


Where 了 and 4(7Y) are7Xx72 matrices. In the following we denote the trace of a 
matrix 4 三 (aij) 节 jby trace4 = 守 记 1aii. 


Lemma 3.1. 矿 trace 4(7) = 0 for da11 史 tpen g( 了) := dety 六 az ivariant oF 
1je 1IQ111X di1jeretial eguatiom (3.7). 


Proof Ifollows from 
det( 了 +e4Y) =det(T 二 s4)det= (1+stace4 二 O(e2))det 普 


that 9g(7)(4Y7) =trace4.detyY (thisis the4bpe[-Lioxmville-yacop 广 Ostrogradsj 
idemlit). Hence, the determinant g( 了 ) = detY is an invariant of the differential 
equation (3.1) 这 trace 4( 三 ) = 0 for all 壮 . 














Since det Y ITepresents the volume of the parallelepiped generated by the 
columns of the matrix , the conservation of the invariant 9g( 节 ) dety is related 
to Volume preservation. This topic will be further discussed in Sect. VI.9. Here,， we 
consider det y as a polynomial invariant of degree m, and we investigate whether 
Runge-Kutta methods can automatically conserve this invariant for 7 > 3. The key 
lemma for this study ls the following. 
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Lemma 3.2 (Feng Kang & Shang Zai-jiu 199S). Zel 民 (z) pe a djerentiapie 
Janaction dejizzed 0 a 1eighpoxrjpood of z = 0 and assu1ie tpat 已 (0) = 1 and 
尼 '(0) =1. 7jem we pavejor7 之 3 


det R(4) =1 Jora1mx77aatriceys 4satisPing trace4=0， (3.2) 
矿 and omly 矿 展 (z) = exp(2). 


Proof The “if part follows from Lemma 3.1, because for constant 4 the solution 
of 立 = 4Y,Y(0) = Tis givenby Y( = exp(41). 

For the proof of the“only 夺 ”part，we consider diagonal matrices of the form 
4= diag(/z 一 (4 十 z,0,...,0), which have trace4 = 0, and for which 


Rd) = diag (EN RO, RCI)) RD， R(0))， 
The assumptions 民 (0) = 1 and (3.2) imply 
RUOJRUODR(-Ww 二 IJ) 一 1 (G.3) 


for all /yy close to 0. Putting zy = 0, this relation yields 民 (J) 尽 (一 J = 1for all /， 
and therefore (3.3) can be written as 


民 (J 尽 (7) = 尽 (4 十 切 ) for all /, zy close to 0. (3.4) 


This functional equation can only be satisfied by the exponential function. This ls 
Seen as follows: from (3.4) we have 
UL 二 eS) 一 尼 OJ 
E E 
Taking the limit s 一 0 we obtain 忌 '(A) = 尽 ( 由 ,because 民 (0) = 1. This implies 
忌 (U) = exp(A)， 

















Theorem 3.3， For 风 过 3，1oO Ru1a8ge-KUtta 1ietpod ca1 co11Serve QU Poly1zo111ia1 
11VG1iG11S Of degyee 7. 


Proojf Itis sufficient to consider linear problems 立 = 4Y with constant matrix 4 
satisfying trace 4 = 0, so that q( 了 ) = detyY is a polynomial invariant of degree 
7. Applying a Runge-Kutta method to such a differential equation yields 2 王 
忌 (4)Yo, where 
尽 ( 四 =1+z0T 一 207I 

( 红 = (0 = (Dand4 = (ai)isthe matrix of Runge- 
Kutta_ coefficients) ls the So-called stability function. It is seen to be rational. 
By Lemma 3.2 it is therefore not possible that det R(P4) = 1 for al 4 with 
trace4 = 0. 














This negative result motivates the Search for new methods which can conserve 
polynomial invariants (See Sects.IV.4,，[IV.8 and VI.9). We consider here another 
interesting class of problems with polynomial invariants of degree higher than two. 
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TIV.3.2 Isospectral Flows 
Such fows are created by a matrix differential equation 
Z=[B(D) 相 ZXO=z G.5) 


Where Zo is a given Symmetric matrix,， 妃 ( 卫 ) is skew-symmetric for all 元 ,and 
[, 岂 =B0O 一 Pisthe commnutator of 已 and 友 Many interesting problems can 
be written in thlis form. We just mention the TIoda System, the continuous realization 
of QR-type algorithms, projected gradient fows, and inverse elgenvalue problems 
(See Chu (1992) and Calvo, Iserles 包 Zanna (1997) for long lists of references). 


Lemma 3.4 (Lax 1968, Flaschka 1974). Let Zo pe sy1a11etric a11d GSSU1Ie 1 太 at 
妃 ( 苑 ) 8 SKew-sy1a11aetric Jr a11 了 7Tjem ie Solutiom 工作 of(3.9) 1 asSy1a11aetric 
1IQ111X，G11C 118 elge1VGLUes G1e Idepe1de1t of 


Pioof The symmetry of 元 ( 切 follows from the fact that the commnutator of a skew- 
Symmetric with a symmetric matrix gives a Symmetric matriX. 
To prove the isospectrality of the fow, we deftine V7(b by 


蕊 一 ( 世 区) 梭 (0) = 工 (3.6) 


Then，we have (d/d(U-ILU) = UV-I 友 一 BD+TLB)II = 0, and hence 
(IN = To foralzt sothat 工 昌 = DT 力 ZoU 人 -Lis the solution 
of (3.3). This proves the result. 














Note that, Since 巨 (7) is skew-symmetric, the matrix U (加 of (3.6) is orthogonal 
by Theorem 1.0. 

Lemma 3.4 shows that the characteristic polynomial det( 区 一 X7) = > 0 aiX; 
and hence the coefficients wi also are independent of 纪 .These coefficients are all 
polynomial invariants (e.g.,，ao = detZ, an 1 = 十 trace 了 ). Because of Theo- 
rem 3.3 there ls no hope that Runge-Kutta methods applied to (3.3) can conserve 
these invariants automatically for7m 之 3. 





Jsospectral Methods，The proof of Lemma 3.4, however, Suggests an interesting 
approach for the numerical solution of (3.3). For 7 = 0,1,... we solve numerically 


DT= BUDD TU(00) = 工 (3.7) 


and we put Zn+1l 三 这 六 这 工 , where 这 is the numerical approximation 他、 7 ( 岂 ) 
after one step (cf. Calvo, Iserles 多 Zanna 1999). 下 妃 () is skew-symmetric for all 
matrices 卫 , then CTUisa quadratic invariant of (3.7) and the methods of Sect.IV.2 
will produce an orthogonal L/. Consequently, Zn+1 and Zn have exactly the Same 
elgenvalues, and they remain Symmetric. 

Diele, Lopez 儿 Politi (1998) suggest the use of the Cayley transform L 王 
(一 2 玖 (十 玉 ，which transforms (3.7) into 


立 =- -了 J)B(UIAUT)GE+Z， YY(0) = 0， 
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and the orthogonality of L into the Skew-Symmetry of y (See Lemma 8.8 below). 
Since all (also explicit) Runge-Kutta methods preserve the Skew-Symmetry of ， 
which is a linear invariant, this yields an approach to explicit isospectral methods. 


Connection with the QR Algorithm. In a diversion from the main theme of this 
Section, we now Show the relationship of the fow of (3.9) with the QR algorithm for 
the symmetric eigenvalue problem. Starting from a real symmetric matrix 4o, the 
basic COR algoritpm (without Shifts) computes a sequence of orthogonally Similar 
matrices 41, 42, 43,... ,expected to converge towards a diagonal matrix carrying 
the eigenvalues of 40. Iteratively forK = 0,1,2,..., one computes the QR decom- 
position of 4 
4 一 COpR 


with CA orthogonal, 尺 : upper triangular (the decomposition becomes unique ifthe 
diagonal elements of PR are taken positive). Then, 4j+1 8 obtained by reversing 
the order of multiplication: 

41+1 一 REG 


It is an easy exercise to Show that Q(KE) = QoQl ...Qk_iis the matrix in the 
orthogonal Similarity transformation between 40 and 45: 


勾 =Q(OD 4oQ( G.8) 
and the same matrix Q(E) is the orthogonal factor in the QR decomposition of 44: 
46 = Q( 问 尼 ( 月 . (3.9) 


Consider now, for an arbitrary real function 三 defined on the eigenvalues of a real 
Symmetric matrix 上 Lo, the QR decomposition 


exp(t ZLZo)) 三 @ 人 已 (G.10) 


and define 

7 的 := 一 QZog 人 . G.11) 
The relations (3.8) and (3.9) then Show that for integer times 上 三 K, the matrixX 
exp(JZ(O)) =Q@(O7 exp(HF(Zo))Q(OR) coincides with the kth matrix in the QR 
algorithm starting from 4o = exp(J(Zo)): 


exp(JCE( 有 ) = 4 G.12) 


Now, how is all this related to the System (3.3)? Differentiating (3.11) as in the 
proof of Lemma 3.4 shows that 卫 ( 女 solves a differential equation of the form 天 = 
[已 , 刀 with the skew-symmetric matrix 成 = -QI7O. At first Sight, however Bisa 
function of 妃 not of 元 . On the other hand, differentiation of (3.10) yields (omitting 
the argument t where it ls clear from the contextb) 


JIo)QR = JIo)exp(tf(Lo)) = exp(tf(LZo))J(Eo) = CR+QR， 
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and since j 帮 (7 = QLAFCEo)Q by (3.11), this becomes 
H 门 =Q QTRR 


Here the left-hand Side is a symmetric matrix, and the right-hand Side ls the sum of a 
Skew-Symmetric and an Upper triangular matrix. It follows that the Skew-Symmetric 
matrix 己 一 一 QQ is given by 


B(D) = FTD+ 一 7 (3.13) 


Where j 丰 ( 卫 )+ denotes the part of /二 ) above the diagonal. Hence, 也 ( 切 is the solu- 
tion of an autonomous System (3.5) with a skew-Ssymmetric 妃 ( 厂 ). 

For Fz) =2Zandassuming Zo symmetric and tridiagonal, the fow of (3.5) with 
(3.13) is known as the 7Tpda Jow. The QR iterates 40 = exp(Zo), 41, 42,.. .ofthe 
exponential of Zo are seen to be equal to the exponentials of the solution 也 (二 of 
the Toda equations at integer times: 4 = exp(Z( 让 )),adiscovery of Symes (1982). 
An interesting connection of the IJoda equations with a mechanical System will be 
discussed in Sect.X.1.5. 

For F(z) = logz,the above arguments show that the QR iteration itself, starting 
ffom a positive definite symmetric tridiagonal matrix, is the evaluation 4 王 艺 (让 ) 
at integer times of a solution 也 ( 坟 of the differential equation (3.3) with 已 given 
by (3.13). This relationship was explored in a series of papers by Deift Li, Nanda 
贸 Tomei (1983, 1989, 1993). 

Notwithstanding the mathematical beauty of this relationship, it must be re- 
marked that the practical QR algorithm (with shifts and defation) follows a different 
path. 
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Und bist du nicht willig, so brauch ich Gewalt. 
(J.W. Goethe, Der EFLKomzzg) 


Suppose we have an (7 一 几 )-dimensional submanifold of 月 "， 
人 M=1y;9)=0} (4 
(9 :下 ”一 及 7), and a differential equation y = jy) with the property that 
Vo E 人 4 implies zy 人 ( 蕊 EM forallt. (4.2) 


We want to emphasize that this asSsumption ls weaker than the requirement that 
all components 0i(V) of g(y) are invariants in the sense of Definition 1.1. In fact， 
assumption (4.2) is equivalent to g'(V) JU = 0fory Ee AM, whereas Definition 1.1 
iequires g' (FJ =0foraly ee 下 "”. Ithesituation of (4.2) we call g(y) a weak 
ivariazt and we Say thaty = jy) is a differential equation on the manifold A4. 
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Fig. 4.1. The implicit midpoint rule applied to the differential equation (4.3). The Picture 
shows the numerical values for q2 十 q2 obtained with step size 娘 = 0.1 (thick line) and 
丸 王 0.05 (thin line) 


了 xample 4.1.， Consider the pendulum equation written in Cartesian coordinates: 


dl 三 D1， D1 三 一 01 和 X) 


(4.3) 
d2 一 D2， Do 三 一 一 ‰2 入 ， 


where 入 = (2 十 2 一 q2)/(oz 十 0).One can check by differentiation that glp1 十 
d22D2 (orthogonality of the position and velocity vectors) ls an invariant in the Sense 
of Definition 1.1. However, q? 十 2 (length ofthe pendulumy) is only a weak invariant. 
The experiment of Fig.4.1 shows that even methods which conserve quadratic first 
integrals (cf. Sect.IV.2) do not conserve the quadratic weak invariant qz 十 02. No 
numerical method that is allowed to evaluate the vector field jy) outside 人 can 
be expected to conserve weak invariants exactly. This is one of the motivations for 
considering the methods of this and the Subsequent Sections. 


Anatural approach to the numerical solution of differential equations on mani- 
folds is by projection (See e.g., Hairer 久 Wanner (1996), Sect. VI.2, Eich-Soellner 
久 Fuhrer (1998), Sect. 5.3.3). 


Algorithm 4.2 (Standard Projection Method). 4ssxzzre tat yw E 人 1. Ope step 
on hy opn+ltsdej1ed asjJollows (See 8 了 2) 


e@ Co1iDtite 人 +1 三 ( 加 )， Were 呈 有 3 01 Ga1Dita1y o1le-step 1ietjpod applied 加 


= 帮 g); 


e@ Dioject ipe valtue Wi o1tto 1pe 1pa1 训 MA io obiaz yn E 人 41. 


人 4 





Fig. 4.2. Ilustration of the standard projection method 


For yn E 人 the distance of +l1 to the manifold A4 is of the size of the local 
error ie., O(hP+1). Therefore, the projection does not deteriorate the convergence 
order of the method. 
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For the computation of yn +1 we have to solve the constrained minimization 
problem 


oo 一 到 一 min Subject to 9g(yn+1) = 0. (4.4) 


In the case ofthe Euclidean norm, a standard approach is to introduce Lagrange mul- 
tipliers 入 = (Xi .，,Xm)7, and to consider the Lagrange function CUyn Hi, 和 ) 一 
1 一 芒 H27/2 -9 人 (7 和. The necessary condition 9C/y+l = 0 then 
leads to the System 





Znp+L 一 孜 +1 十 多 ( 苞 1) 和 
0 三 gmn+1) 


We have replaced yn+1l with 加 + in the argument of g(V) in order to save some 
evaluations of g'(y). Inserting the first relation of (4.5) into the second gives a non- 
linear equation for 入 which can be efficiently solved by Simpliftied Newton itera- 
tions: 


= 
Xi 一 一 (9 和 Hg 和 HT) (天 Hg 的 DTAi， Ai+1l 二 Xi 十 AN. 


Forthe choice Xo = 0the firstincrement ANXo is of size O(PP+1), so that the conver- 
gence is usually extremely fast. Often, one Simplified Newton iteration ls Sufficient. 


(4.5) 





了 xample 4.3. As a first example we consider the 
perturbed 天 epler problem (see Exercise [I.12) with 
了 amiltonian function 


1 1 
刀 (P， 一 
( 9) 2 (21 2 ) V 字 十 吧 
0.005 
2V( 二 3 
and initial values dl1(0) = 1 一 e, az(0) = 0， 
D1(0) = 0, pz(0) = V(1L +e)/L 一 ej (eccentric- 
itye 王 0.6) on the interval 0 乞 土 冬 200. The exact 
solution (plotted to the righb is approximately an ellipse that rotates Slowly around 
one of its foci. For this problem we know two first integrals: the Hamiltonian func- 
tion 瑟 (p, q) and the angular momentum 了 (P, dg) = q1Da2 一 q2D1. 

We apply the explicit Euler method and the Symplectic Euler method (I.1.9)， 
both with constant Step Size 几 = 0.03. The result is shown in Fig.4.3. The nu- 
merical solution of the explicit Euler method (without projection) ls completely 
WwWIiong. The projection onto the manifold { 万 (pz,9) = 互 (po,qo)} improves the nu- 
merical solution, but it still has a wrong qualitative behaviour. Only projection onto 
both invariants, 万 (p,9) = Comst and 卫 (D,9) = Comst gives the correct behav- 
iour. The Symplectic Euler method already Shows the correct behaviour without 
any projections (See Chap.IX for an explanation). Surprisingly, a projection onto 
刀 (p,dq) = Comst destroys this behaviour the numerical solution approaches the 
centre and the Simplified Newton iterations fail to converge beyond 上 一 25.23. Pro- 
jection onto both invariants re-establishes the correct behaviour. 


[ 
exact Solution 








0 











112 


explicit Euler, 刀 三 0.03 
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Fig. 4.4. Explicit Euler method with projections applied to the outer solar system, Step Size 
玉 三 10 (days), interval 0 区 过 200 000 


上 xample 4.4 (Outer Solar System). Having encountered excellent experience 
WwWith projections onto 妃 and 也 for the perturbed 天 epler problem (Example 4.3)， 
let us apply the Same idea to a more realistic problem in celestial mechanics. We 
consider the outer Solar System as described in Sect.I.2. The numerical solution 


of the explicit Euler method applied with constant step Size 尹 = 10, once with 
projection onto 克 = _ Const and once with projection onto = Const and 
卫 = Comst, is Shown in Fig.4.4 (observe that the conservation of the angular 


momentum 厂 (D, 9) 


> di X Di Consists of three first integrals). We See a Slight 


improvement in the orbits of Jupiter 9aturn and Uranus (Compared to the explicit 
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Euler method without projections, see Fig.I2.4), but the orblt of Neptune becomes 
even Worse. There is no doubt that this problem contains a Structure which cannot 
be correctly simulated by methods that only preserve the total energy 巨 and the 
angular momentum 也 . 


上 xample 4.S (Volume Preservation). Consider the matrix differential equation 
六 = 4(Z)Y, where trace 4(Y) = 0 for all Y. We know from Lemma 3.1 that 
9(7Y) = detyY is an invariant which cannot be automatically conserved by Runge-- 
Kutta methods. Here, we Show how we can enforce this invariant by projection. Let 
Yn+l be the numerical approximation obtained with an arbitrary one-step method. 


We consider the Frobenius norm | = /20i2 formeasuring the distance 


to the manifold { 科 ; g() = 0}. Using g%(Y)(4Y) = trace4 detY (see the proof 
of Lemma 3.1) with 4 chosen such that the product 4Y contains only one non-zero 
element, the projection step (4.3) is Seen to become (Exercise 9) 


2 = 作 十 HT (4.60) 


with the Scalar /人 一 入 det 到 411. This leads to the scalar nonlinear equation 
det ( 攻 +1 十 HZ) 一 det 岂 , for which Simplified Newton iterations become 


det( 区 + 十 让 (1 十 (UL 一 Ji) trace(( 冯 2 攻 +-) ) 一 det 世 . 


If the CR-decomposition of 这 麻 is available from the computation of det 了 
the value of trace((ZT 1 卫 +)-9) can be computed efficiently with O(n”3/3) flops 
(See e.g., Golub 信 Van Loan (1989), 9ect. 5.3.9)， 

The above projection is preferable to 了 允 +1 三 c 思 1 where c E 慌 is chosen 
Such that det +1 三 det 区. This latter projection is already i-conditioned for 
diagonal matrices with entries that differ by Several magnitudes. 


As a conclusion to the above numerical experiments we See that a projection 
can give excellent results, but can also destroy the good long-time behaviour of the 
solution iapplied inappropriately. I the original method already preserves Some 
Structure, then projection to a Subset of invariants may destroy the good long-time 
behaviour An Important modification for reversible differential equations (Symmet- 
Tic projections) will be Presented in Sect. V.4.1. 


TIVS Numerical Methods Based on Local Coordinates 


A second important class of methods for the numerical treatment of differential 
equations on manifolds uses local coordinates. Before explaining the ideas, we find 
it appropriate to discuss in more detail manifolds and differential equations on man- 
ifolds. 
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IV.S.1 Manifolds and the Tangent Space 


In Sect.IV.4 we assumed that locally (in a neighbourhood Z of co < 陈 "”) a manifold 
ls given by constraints, 1.e.， 


人 =EUi9)=0)， (G.D) 


whereg :U 一 及 mis differentiable, g(a) = 0, and qg(a) has full rank 77. 

Here, we use local parameters to characterize a manifold. Let 钞 :一 开 "”be 
differentiable (Y C 月” mi a neighbourhood of 0), V(0) = a, and assume that 
凡 (0) has full rank 7m 一 72. Then, a manifold is locally given by 


人 = 了 = 2ET) (5.2) 


provided that 了 is sufficiently small, so that 见 : Y 一 W(P) is bijective with 
continuous inverse. The variables > are called Paya1aeters QT /ocal coo1dijzatesy of 
the manifold. 

As an example, consider the unit Sphere which, in the form (3.1), is given by the 
function g(yi,y2,y3) = 好 十 吧 十 好 一 1.There are many possible choices of local 
coordinates. Away from the equator (il.e., ya 一 0)， We can take 2z 一 (2 2z2) 工 





(oo) and wW(2) 一 (2 ， 22; 士 V1L 一 这 一 地 ) Alternatively，we can consider 


Spherical coordinates W(a,D) 三 (cos asin CO,sin a sin O, coSs 有 7 away from the 
north and South poles (ie., yl = ya2 三 0, ya 三 士 1). 

The tangent to a curve (or the tangent plane to a Surface) is an affine Space 
passing through the contact point w E 人 A4. It is convenient to place the origin at aw， 
So that we obtain a vector Space. More precisely, for a manifold 人 4 we define the 
1a118e11 SDace at Q E 人 4 as 





砂 AM 人 c Ra there exists a differentiable path 7 : (一 人 汪 一 了 豚 ” 05.3) 


with y( 坟 < Afforallt 7y(0) = a 3(0) = 





Lemma S$.1. 矿 1e 71a11ol AN Siven py (3.1 whereg:U 一 下 ”1dizeren- 
1iaple, g(a) = 0 a1d g(a) Pas jl1l ya 7 te we jave 


有 AM=kerg(ol ={ER 了 "19(auv = 0 (3.4) 


矿 .AN Sien py (9.2) wjperez: 太一 有 "iderentiaplie (0) = @ aad (0) 
Pas Joll 7a1K 风 一 7 加 e1i We jave 


有 AM = ImnwW(0) = {W (Ole 开光 上 (5.5) 


Proof a) For apath 7y(b satisfying 7(0) = wa and g(7y 人 的 ) = 0itfollows by differ- 
entiation that g'(a)y(0) = 0. Consequently, we have TuAM C kerg'(a). 
Considernow the function 严 ( 态 ) = g(a 十 妃 十 9 (ao Wehave 忆 (0,0) =0 
and an invertible OFV/6u(0,0) = 9g(ajg'(a)7 so that by the implicit function the- 
orem the relation 天 ( 坊 风 ) 三 0 can be solved locally for 三 民 (区 . 玉 veE kerg (ao)， 
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it follows that 飞 (0) = 0, and the path 7 的 = na 十 妃 十 g (av 人 bsatisfies all 
redquirements of ($.3), so that also TuA4 2D ker g'(a). 

b) Assume 人 to be given by (5$.2). For an arbitrary 7 : (一 s, E) 一 豚 岂 satisfying 
1(0) = 0, the path 7( 信 瑟 (7() lies in AM and satisfies 3 了 (0) = 几 (0)7(0). This 
proves Imw'(0) C TaA4. 

The assumption on the rank of (0) implies that，after a reordering of the 
components，we have W(z) = (Wi(2),Wa()7, where Wi(z) is alocal diffeomor- 
phism (by the inverse function theorem). We show that every Smooth path 7( 娘 记 
AM can be written as 7y( = 坊 (7( 鸭 ) with some smooth 7( 坟 . This then implies 
TaA C ImwW'(0). To prove this we split 7 信 = (7 为 ,72( 坟 )7 according to the 
partitioning of W, and we define 7 人 = 困 T(7i( 鸭 ).Sincefory(b < AMthe second 
part ?2(b is uniquely determined by 7i(b, this proves 7Y(t) = (7( 鸭 ). 














The proof of the preceding lemma Shows 
the equivalence of the representations (3.1) and 
(S$.2) of manifolds in 陈 "”. Let 人 4 be given by 


(9.1), and assume that the columns of Q form Q Ci 

an orthogonal basis of TaA4. As in part (a) of 

the proof of Lemma 5.1 the condition g(a 十 9 (aTuw 
Qz+g9g'(a)7vu) = 0 defines locally (close to 全 ( 节 


2 一 0) afunction v(z) which satisfies w(0) 王 
0 and w(0) = 0. Hence, the manifold A4 is 
also given by ($.2) with the function W 光 (2z) = 十 Qz 十 9 (a)7u(2)， 

On the other hand, let A4 be given by (S$.2). Part (b) of the proof of Lemma 5.1 
Shows that y = W(z) can be partitioned into yl = Wi(z) and ya = Wo2(2)，where 
Wi is a local diffeomorphism. Consequently, 人 is also given by (35.1) with 9(y) = 
妈 一 妇 ( 困 (0))， 


IV.S.2 Differential 了 Equations on Manifolds 


In Sect.IV.4 we introduced differential equations on a manifold as problems satis- 
fying (4.2). With the help of Lemma 5.1 we are now in a position to characterize 
Such problems without knowledge of the solutions. 


Theorem S.2. ZLef AH pe a SUBD1zaa1t1d of 了 "7Tjpe proplieyy = 0) NadQNeren- 
1ial eguatiom o1 加 e 11a11 加 1 ALe isSatisjiesy (2 矿 a1ad omly 矿 


jy) ET Joral1y E 人 4. (3.60) 


Poof The necessity of (5.6) follows from the definition of 7yA4, because the exact 
solution of the differential equation lies in A4 and has jy) as derivative. 

To prove the sufficiency, we assume (3.6) and let 人 AM be locally, near yo, be given 
by a parametrization y = W(z) as in (5$.2). We try to write the solution of y = 站 )， 
V(0) = yo 三 W(20) as yy 的 = Up(z 才 ). 开 this is at al possible, then z(t must 
Satisfy 
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风 (2)2 一 了 (%(2)) 


which, by assumption (3.6) and the second part of Lemma 5.1, is equivalent to 


冯 一 风 (2 Fa)， (5.7) 


where 4+ = (47 4)-147 denotes the pseudo-inverse of a matrix with full column 
Tank. Conversely, define z( as the solution of (5.7) with z(0) = >0, which is known 
to _ exist locally in 上 by the standard existence and uniqueness theory of ordinary 
differential equations on 月 耿 . Then y( 坟 = 未 (z( 坟 ) is the solution ofy = /with 
V(0) = yo. Hence, the solution y( 人 (tb remains in 人 4. 














We remark that the sufficiency proof of Theorem 5.2 only redquires the function 
jyV) to be defined on 人 4. Due to the equivalence of y = jy) with (5.7) the prob- 
lem is transported to the Space of local coordinates. The standard local theory for 
ordinary differential equations on an Euclidean Space (existence and uniqueness of 
Solutions, ...) can thus be extended in a straightforward way to differential equa- 
tions on manifolds, ie., 7 = JU) with 三 : A4 一 及 ”satisfying (5.60). 


IV.S.3 Numerical Integrators on Manifolds 


Whereas the projection methods of Sect. IV.4 require the function jy) ofthe differ- 
ential equation to be defined in a neighbourhood of 人 A4 (see Fig.4.2), the numerical 
methods of this section evaluate jy) only on the manifold A4. The idea is to apply 
the numerical integrator in the parameter Space rather than in the Space where AM is 
embedded. 


Algorithm S$.3 (Local Coordinates Approach). 4ssxrze tat yn E 人 1 ad 1nat 示 
ia Vocal Pa1a1aetrization of A Satighinag W( 加 ) 三 加 One step 加 记 UnH1L TS 
dejijzed as Jol1iows (See Fi8.3.1) 


e@ Co1iDtie 轩 1 三 Gp( 加) te yesut ortjpe 11etpod Gh applied io (3.1 
e@ dejijze 妨 e 1a1aerical solution py Vn+l 三 翁 ( 2 十 1). 


玉 DIDPorta1it to 7e11101 天 态 at te pa1a1aetrizationy 一 Up(z) ca1a pe cjpa1aged ateve1y 
S1eD. 





Fig. S$.1. The numerical solution of differential equations on manifolds via local coordinates 
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As indicated at the beginning of Sect.IV.3.1, there are many possible choices 
of local coordinates. Consider the pendulum equation of Example 4.1, where A4 = 三 
{(@q2, pl pz)| 十 0 一 1 dp1l 十 dp2 一 0}.Astandard parametrization here 
is gl 一 Sin Qa, ga 三 一 cog Q,D1 一 wcosa,andDpo 一 wSsina.JInthenew coordinates 
(a,w) the problem becomes Simply G = w,w = 一 sin a. Other typical choices are 
the exponential map W(2) = exp(2) for differential equations on Lie groups, and 
the Cayley transform W(2) = (TI 一 ZJ-1+2) forquadratic Lie groups. This will 
be studied in more detail in Sect.IV.8 below. Here we discuss two commonly used 
choices which do not use a Special structure of the manifold. 


Generalized Coordinate Partitioning. We assume that the manifold is given by 
(3.1). 斑 9 : 展 ” 一 及 ”has a Jacobian with full rank 7 at y = w, we can find a par- 
titioning y = (Wi,yo), such that 99/amo(a) is invertible. mn this case we can choose 
the components of yi as local coordinates. The function y = 内 (2z) is then given by 
贡 = :and = 妨 (2 where Wo(z) is implicitly defined by g(z,W%a(2)) = 0. 
This approach has been promoted by Wehage 儿 Haug (1982) in the context of con- 
strained mechanical Systems, and the partitioning is found by Gaussian elimination 
with full pivoting applied to the matrix 9g'(a). Another way of fnding the partition- 
ing ls by the use of the QR decomposition with column change. 


Tangent Space Parametrization. Let the manifold A4 be given by (3.1)，and 
collect the vectors of an orthogonal basis of TuM in the matrix C. We then consider 
the parametrization 

wa(2) 一 aa 十 Qz 二 9g (au 人 2)， (3.8) 


where vw(z) is defined by g(Wa(2) = 0, exactly as in the discussion after the proof 
of Lemma 5.1. Differentiating (3.8) yields 


(Q+9(O VD)2=D= 7 = 帮 we) 


Since QIQ = Tand g(ajQ = 0, this relation is equivalent to the differential 
equation 


一 @ (pa(z))， G.9) 


which corresponds to (5.7). 下 we apply a numerical method to (5.9), every function 
evaluation requires the projection of an element of the tangent Space onto the mani- 
fold. This procedure is illustrated in Fig. 3.1, and was originally proposed by Potra 多 
Rheinboldt (1991) for the solution of the Euler-Lagrange equations of constrained 
multibody Systems (See also Hairer & Wanner (19960), p.470). 
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IV.6 Differential 上 Equations on Lie Croups 


Theorem 1.6 and Lemma 3.1 are particu- 
lar cases of a more general result which can 
be conveniently formulated with the concept 
of Lie groups and Lie algebras (See Olver 
(1986) and Varadarajan (1974) for an intro- 
duction to these Subjects). 

ALie group fa group CG which is a dif- 
ferentiable manifold, and for which the prod- 
uct is a differentiable mapping C x CG 一 G. 
We restrict OUT considerations to 712Q11rix Zie 
8S1oUDs, that 1S$，Lie groups which are Sub- 
groups of GL(m)，the group of invertible 
7 X7matrices with the usual matrix prod- 
uct as the group operation. 





了 xample 6.1. An important example of a . 
Lie group is the group Marius Sophus Lie 


oO ={7esGL1Y7r = 全 


of all orthogonal matrices. It is the zero set of g( 了 ) = Y7 和 一 六 where we consider 
9 as a mapping from the Set of al 7 x 7 matrices (ie., 及””) to the set of all 
symmetric matrices (which can be identified with 月 "("+D72). The derivative 9 (三 ) 
is surjective for YY E O(m), because for any symmetric matrix 玫 the choice 互 = 
Y 开 /2 solves the equation 9 ( 妆 ) 克 = 天 .Therefore, the matrix 9 ( 六 ) has full rank 
(cf. (5.1)) so that O(m) defines a differentiable manifold of dimension m2 一 m(m 十 
1)/2 王 双 (人 一 1)/2.The setO(m) is also a group with unit element 7 (the identity). 
Since the matrix multiplication is a differentiable mapping, O(m) is a Lie group. 


Table 6.1 lists further prominent examples. The matrix ./ appearing in the defi- 
nition of the Symplectic group is the matrix determining the Symplectic Structure on 
及 ”(see Sect. VI.2). 

As the following lemma shows, the tangent Space g = T7G at the identity 7 of 
a matrix Lie group CG is closed under forming commnutators of its elements. This 
makes g an algebra, the Zie algepra of the Lie group C@7. 


Lemma 06.2 (Lie Bracket and Lie Algebra). ef G pe Ga7Patrix Zie 81oxD a1d 1et 
g 三 T7G pe ie 1a11ge11 spDace af tjpe identity 7T1e ZLie prackef(or co111U1a1o 丰 


[4,B=4B-B4 (6.1) 


dejizzes dz ope1afion gx 日 一 gwjicpn iDiinear Semw-synzetric (4 也 | = 
一 [ 妃 ,4) and satisjies te Jacopi identity 


L4,[B,C] + [C,L4 相 +[B,IC,4] = 0 (6.2) 


3 Marius Sophus Lie, born: 17 December 1842 in Nordfjordeid (Norway), died: 18 February 
1899. 
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Table 6.1. Some matrix Lie groups and their corresponding Lie algebras 



































Lie group Lie algebra 
GLm) = {7|det 并 和 0} g[n) = {4larbitrary matrix} 
general linear group Lie algebra of 7 xm matrices 
SLp) = {|det=1} 5[(n) = {4|trace(4) =0} 
Special linear group Special linear Lie algebra 
oO) = {71Y77= 娘 5o(m) = {4147+4=0} 
orthogonal group Skew-Symmetric matrices 
SO) = {YeoOm)|ldet 和 三 1 5so(m) = {4147+4=0} 
Special orthogonal group Skew-Symmetric matrices 
Spm) = {ZlyY7yYr= 必 sb) = {4|7J4+477J=0} 
Symplectic group 








Proof By definition of the tangent space, for 4,B Ee g, there exist differentiable 
paths a( 坟 ,GO ( 直 < sinGsuchthat a 人 =T 寺 1 的 with acontinuous function 
4 with 4(0) = 4,and similarly Cg 王 T 十 雪 扫 with 刀 (0) = .Now consider 
the path y( 轨 in G defined by 


7 的 = a(VDB(VBDa(VD-16(VD-L，t>0 
An elementary compnutation then yields 
yy 的 = 工 + 上 [4,B] 上 +o 人 0 
With the extension y( 介 = (一 力 -1 for negative 蕊 this is a differentiable path in 
G satisfying 7Y(0) = 了 Tand3y(0) = [4,B. Hence [4, Be gby defiinition of the 


tangent Space. The properties of the Lie bracket can be verified in a Straightforward 
Way. 














了 Example 6.3. Consider again the orthogonal group O(?). Since the derivative of 
0(O) =Y77 一 Tattheidentity is 9g( 丰 媚 = 开刀 二 瓦 IT7T= 玖 十 五 ,itfollows 
from the first part of Lemma 5.1 that the Lie algebra corresponding to O(m) consists 
of all skew-symmetric matrices. The right column of Iable 6.1 gives the Lie algebras 
of the other Lie groups listed there. 


The following basic lemma shows that the exponential map yields a local para- 
metrization of the Lie group near the identity, with the Lie algebra (a linear Space) 
as the parameter Space. 
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Lemma 6.4 (Exponential Map). Cozsider a 1Iat1rix ZLie 81oUD G a1d it Lie alge- 
BD1a g. 7Tjpe 1IQ11iX exDpO1e11iQL 13 Q 1110D 


exp :8 一 @， 


ie Jor4eEgwepaveexp(4) EG. Moreover exp 1a1ocal dieomorpjism za 
1eigjppoxrpood of 4 = 0. 


Proof For 4eg,itfolows from the definition ofthe tangent spaceg 三 T7G that 
there exists a differentiable path w( 坟 in G satisfying aw(0) = 了 and c(0) = 4. For 
afxed 和 E G, the path 7y 扩 := a( 罗 isin Gand satisfies 7y(0) = 了 andy(0) 三 
4Y. Consequently 4Y ce TyGandYy = 4Y defines a differential equation on the 
manifold C. The solution 了 ( 信 =exp(t4) is therefore in G for all 蕊 

Since exp( 五 ) - exp(0) = 互 +O( 古 2?), the derivative of the exponential map 
at 4 三 0istheidentity, and it follows from the inverse function theorem that exp is 
alocal diffeomorphism closeto 4 = 0. 














The proof of Lemma 6.4 Shows that for a matrix Lie group Cr the tangent Space 
aty E Chas the form 
TyG={4Y7|14eg (0.3) 


By Theorem 3.2, differential equations on a matrix Lie group (considered as a man- 
ifold) can therefore be written as 


立 = 4(Y)Y (6.4) 


where 4( 记 JEgforaly es G.Thefolowing theorem summarizes this discussion， 
and extends the statements of Theorem 1.6 and Lemma 3.1 to more general matriX 
Lie groups. 


Theorem 6.S. Zer G pe a 7172atrixZLie gfromp adg 必 Ziealsepbra. 1 47) Eggjor 
a1YEGandtyYoeEG, te tpesyotution ofrf6. 了 和 satisjiegsy(t 切 EGJoralt 














Hin addition 4(Y) e gfor all matrices Y, and 让 
G={Y|19(7) = Const 


is one of the Lie groups of Table 6.1, then g(Y) is an invariant of the differential 
equation (0.4) in the sense of Definition 1.1. 
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IV.7 Methods Based on the Magnus Series 上 Expansion 


Before we discuss the numerical Solution of 
differential equations (6.4) on Lie groups, let 
us give an explicit formula for the solution of 
linear matrix differential equations 


立 = 4( 闷 (7.1) 


No assumption on the matrix 4( 为 is made 
for the moment (apart from continuous de- 
pendence on 加 . For the scalar case, the Solu- 
tion of (7.1) with Y(0) = Yo is given by 


Y( = exp ( 下 4(7) dr 巧 ，。 0792) 


Also in the case where the matrices 4(t) and 
万 4(7) dr commnute, (7.2) is the solution of 
(7.1). In the general non-commnutative case 
we follow the approach of Magnus (1954) and we search for a matrix function 92 (加 
Such that 





Wilhelm Magnus4 


匠 =exp( 人 (2 人) 苞 


Solves (7.1). The main ingredient for the solution will be the inverse of the derivative 
of the matrix exponential. It has been studied in Sect. JIU.4, Lemma IIU.4.2, and is 
given by 


dexpP( 百 = 》 一 adp( 万 )， (7.3) 


where Bk are the Bernoulli numbers, and ad p(4) = [024 =I4--400isthe 
adjoint operator introduced in (HILL.4.1). 


Theorem 7.1 (Magnus 19S4). 71Pe solxtiomn ofipe dijerential equation (7.1) cam pe 
Writtenas 艺人 一 exp(C 的 ) 部 wii Pb dejiped 区 

2=dexpp(40)， 2(00) =0. (7.4) 
hs 1onrg as | 2 的 | < 和 tpe comvergence oftje dexpPi erpa1nasiom (7.3) 58 assured 
Proof Comparing the derivative of (人 = exp(C2 人 的) 了 50， 


YO= (所 ep20)20 克 = (aepoo(20)) ep(20) 


with (7.1D) we obtain 4 人 的 三 dexppd (2(). Applying the inverse operator 
dexpPl to this relation yields the differential equation (7.4) for 12(. The state- 
Iment on the convergence is a consequence of Lemma II.4.2. 














4 Wilhelm Magnus, born: 5 February 1907 in Berlin (Germany), died: 15 October 1990. 
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The first few Bernoulli numbers are Bo = 1 Bi = -1/2, B。 = 1/6, Bs = 0. 
The differential equation (7.4) therefore becomes 


2= .40 一 3 [2,4(] + 四 [2 [2,4(9]| 人 


which is nonlinear in 4Y2. Applying Picard fxed point iteration after integration yields 


PUW = /amDw 汪 [4owao d7 
十 上/ | 上/ 上 上/ 400du 4 da d 困 07.5) 


可 14om /400waam]|lwr 


Which is the So-called Magmaus expa1sio1. For Smooth matrices 4(b the remain- 
der in (7.$) is of Size O( 友 ) so that the truncated series inserted into Y(b 一 
exp( 2 人)2 gives an excellent approximation to the solution of (7.1) for Small 





Numerical Methods Based on the Magnus 上 xpansion。 Iserles 贸 Ngrsett (1999) 
Study the general form of the Magnus expansion (7.3), and they relate the iterated 
integrals and the rational coefficients in (7.3) to binary trees. For a numerical inte- 
gration of 
立 = 4 作 页  Y(o= 巧 (7.6) 

(where yY is a matrix or a vector) they propose using 了 +1 = exp( 几 fp) 到 ,where 
几 f2n is a suitable approximation of 12( 刀 ) given by (7.5) with 4( 刀 十 T) instead of 
4(7T). Of course, the Magnus expansion has to be truncated and the integrals have 
to be approximated by numerical quadrature. 

We follow here the collocation approach Suggested by Zanna (1999). The idea 
is to replace 4( 为 locally by an interpolation polynomial 


and to solveY = 4(0) 普 on [ 刀 , 刀 十 川 bytheuseofthe truncated series (7.5). 


Theorem 7.2. Comsider da guUad1atu1e jatla (ici)2 1 oforderDp > s aad jet 
Y( and 2G0b0 pesotutions of = 4(D7 and2 = 4(2 7espectively satishing 
z(t) 三 Gt). Te 2G(t 十 阅 一 Y( 二 十 加 =O(PP+L). 

Proof We write the differential equation for 2 as 性 = 4(2 十 (4 一 4(0))2G 
and use the variation of constants formula to get 


刀 十 户 四 
2 十 站 一 了 (二 十 内 = 站 Rn 十 刀 D( ar) - 4(r))2(r)dr 
tm 
Applying our quadrature formula to this integral gives zero as result, and the re- 
mainder is of size O(PP+1L). Details of the proof are as for Theorem IIL.1.5. 
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了 Example 7.3. As a first example, we use the midpoint rule (cl = 1/2, 01 = 1). P 
this case the interpolation polynomial is constant, and the method becomes 


艺 本 三 exp(74(t 平 P/3)) 友 ， (7.7) 


which is of order 2. 





了 Example 7.4. The two-stage Gauss quadrature is given by cl = 1/2 土 V3/ 6， 
012 一 1/2. The interpolation polynomial is of degree one and we have to apply 
(7.5) in order to get an approximation x+1. Since we are interested in a fourth 
order approximation, we can neglect the remainder term (indicated by ... in (7.3)). 
Computing analytically the iterated integrals over products of li (加 we obtain 


及 3 瑟 ” 
Yn+1i 三 cp(34 十 42) 十 “bog】 了 mn， 


(7.8) 





where 41 = 4( 志 十 cijp) and 4 = 4( 志 十 czj 由 .This is a method of order four. 
The terms of (7.5) with triple integrals give O(]j4) expressions, whose leading term 
vanishes by the Symmetry of the method (Exercise V.7). Therefore, they need not 
be considered. 


Theorem 7.2 allows us to obtain methods of arbitrarily high order. A Straightfor- 
warduse of the expanslion (7.3) yields an expression with alarge number of commu- 
tators. Munthe-Kaas 儿 Owren (1999) and Blanes, Casas 作 Ros (2000a) construct 
higher order methods with a reduced number of commnutators. For example, for or- 
der 6 the required number of commnutators is reduced from 7 to 4. 

Let us Temark that all numerical methods of this section are of the form 
了 21 = exp(RO2) 了 区， Where fn is a linear combination of 4( 妃 十 ciz) and of 
their commutators. 4( 轨 Eggforallt thenalsoPf2n liesin the Lie algebra g, so 
that the numerical solution stays in the Lie group CiLYo E CG (this is aconsequence 
of Lemma 0.4). 


IV.S Lie Group Methods 


Consider a differential equation 
立 =4(7)Z  Y(0)= 巧 (8.1) 


on a matrix Lie group G. This means that Yo E G and that 4(Y) E gfor al 
ECG.Sincethis is aspecial case of differential equations on amanifold, projection 
methods (Sect.IV.4) as well as methods based on local coordinates (Sect.IV.5) are 
well suited for their numerical treatment. Here we present further approaches which 
also yield approximations that lie on the manifold. 

All numerical methods of this section can be extended in a straightforward way 
to non-autonomous problems Y = 4(tYZJY with 4(tY) egforalltandall 
Y E G.Justto simplify the notation we restrict ourselves to the formulation (8.1). 
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IV.8.1 Crouch-Grossman Methods 


The discipline of Lie-group methods owes a great deal to the pioneering 
work of Peter Crouch and his co-workers . . . 
(A. Iserles, H.Z. Munthe-Kaas, 3S.P Ngwrsett & A. Zanna 2000) 


The numerical approximation of explicit Runge-Kutta methods is obtained by a 
composition of the following two basic operations: (人 an evaluation of the vector 
field /7) = 4( 了 和 and (ii acomputation of an update of the form 和 十 Pa 
For example, the left method of (U.1.3) consists of the following steps: evaluate 
天 1 = /Yo); compnute 到 = 2 十 几 Ki;evaluate 天 2 一 用 六 ); compute 史 /2 一 
Yo 十 分 有 1; compnute 好 三 到 /2 十 公开 2. 

In the context of differential equations on Lie groups, these methods have the 
disadvantage that even when 和 ECGand2 eaG,theupdate 节 十 ja4(G)2 is im 
general not in the Lie group. The idea of Crouch & Grossman (1993) is to replace 
the“update”operation with exp(ja4(2)). 


Definition 8.1. Let waij (07 三 1 .5) be real numbers. An explicit 5s-s1aSe 
C1oucjHn-G1oss11011 11etpod is given by 


YY exp(Paii1Ki-1) exp(hail 开 1) 7 Fi = 4(Y(0))， 
IT1 三 exp(PhDs 开 。) .exp(PDOIKT) 芒 . 





For example, the method of Runge described above (s 一 2, a21 一 1,01 三 2 
1/2) leads to 





史 H = exp(2K2] exp( 2) 区 ， (8.2) 


Where K1 = 4( 芭 ) and 天。 王 4(exp(PFKT) 芳 )， 

By construction, the methods of Crouch-Grossman give rise to approximations 
Yn which lie exactly on the manifold defined by the Lie group. But what can be Said 
about their order of accuracy? 


Theorem 8.2. Zetci >， 了 Qi 4 CorcjHn-G1oss11a1 11etpod pasy o1derDpD 芭 3) 矿 
1je Jo1iowing order co11dilio118s Ga1e SG1isJjied- 


oO1der 1: > 一 1 (8.3) 
o1der 2 : > ;poici 一 1/2 (8.4) 
o1der3: > ,5cz 一 1/3 (8.5) 
> biaijcj 一 1/6 (8.0) 
2 0ci 十 20ci0o = 一 1/3. (8.7) 


Proojf As in the case of Runge-Kutta methods, the order conditions can be found 
by comparing the Taylor series expanslions of the exact and the numerical solution. 
In addition to the conditions stated in the theorem, this leads to relations Such as 
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2c+2y oo 一 了 (8.8) 


2< 了 7 


Adding this equation to (8.7) we find 2 > bicibj 一 1, Which is satisfied by (8.3) 
and (8.4). Hence, the relation (8.8) is already a consequence of the conditions stated 
in the theorem. 














Table 8.1. Crouch-Grossman methods of order 3 








0 0 
-1/24 | -1/24 3/4 3/4 
17/24 | 161/24 -6 17/24 | 119/216 17/108 
| 1 -2/3 2/3 | 13/51 -2/3 24/17 


Crouch 儿 Grossman (1993) present Several Solutions of the System (8.3)--(8.7)， 
one of which is given in the left array of Table 8.1. The construction of higher order 


Crouch-Grossman methods is very complicated ( ... any attempt to analyze algo- 
rithms of order greater than three will be very complex, . . , Crouch 久 Grossman， 
1993). 


The theory of order conditions for Runge-Kutta methods (Sect. II.1) has been 
extended to Crouch-Grossman methods by Owren 儿 Marthinsen (1999). It turns out 
that the order conditions for classical Runge-Kutta methods form a Subset of those 
for Crouch-Grossman methods. The first new condition ls (8.7). For a method of 
order 4, thirteen conditions (including those of Theorem 8.2) have to be satisfied. 
Solving these equations, Owren 贸 Marthinsen (1999) construct a 4th order method 
with s 一 5 Stages. 


IV.8.2 Munthe-Kaas Methods 


These methods were developed in a Series of papers by Munthe-Kaas (1993, 1998， 
1999). The main motivation behind the first of these papers was to develop a the- 
ory of Runge-Kutta methods in a coordinate-free framework. After attempts that 
led to new order conditions (as for the Crouch-Grossman methods), Munthe-Kaas 
(1999) had the idea to write the solution asY( 塌 = exp(2()Zo and to solve 
numerically the differential equation for 12( 坟 . I sounds awkward to replace the 
differential equation (8.1) by a more complicated one. However the nonlinear in- 
variants g(Y) = 0 of (8.1) defining the Lie group are replaced with linear invariants 
0 (站 (2) = 0defining the Lie algebra, and we know from Sect.IV.1 that essentially 
all numerical methods automatically conserve linear invariants. 

It follows from the proof of Theorem 7.1 that the solution of (8.1) can be written 
as 了 (一 exp(8( 信 ) 芳 , where Pb is the solution of 1 = dexpFl (4(Y(b))， 
12(0) = 0. Since it is not practical to work with the operator d expD 1， we truncate 
the Series (7.3) Suitably and consider the differential equation 
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2=4(exp(D) 苞 +》、 ad 包 (4(exp(9)2))， 2(0) =0.， (8.9) 
KE=1 


This leads to the following method. 


Algorithm 8.3 (Munthe- 开 aas 1999). Comsider 加 e proplem (8S.1) mw 4(7) <Eg 
JorY E G.A4Ssu11e 1 加 af yn liesy 六 加 e Zie 81otp GT1em ie step 阅 上 1 
dejijzed as jiowsy: 


@ CO11Sider 1pe dijerential eqguation (8.9) 1 太 区 108stead of yo, ad apPpiy da Ru1a8ge 一 
KBtta 1aetpod (explicit or DiP1ic 动 1o geta1z apProxiaation 121 福 12( 几 )， 
@ 1 太 he1l dejije 加 e 11U11eFical SOULiOn Dy 1 一 exp((21) 防 . 


Before analyzing this algorithm, we emphasize its close relationship with Algo- 
Tithm $.3. In fact, 放 we identify the Lie algebra g with 及 & (where Kis the dimension 
of the vector space gj, the mapping V(D) = exp( 人 1) 到 is alocal parametrization 
of the Lie group CG (see Lemma 6.4). Apart from the truncation of the series in (8.9)， 
Algorithm 8.3 is a Special case of Algorithm 3.3. 

Important properties of the Munthe-Kaas methods are given in the next two 
theorems. 


Theorem 8.4. Zer G pe a 1aatrix Lie 81op ad g 帮 Zie algebra. 夏 4(7) < 9 
JoryY eaGand tryo E G, tipen tpe 11ierical solufion of ipe ZLie 8g1oup 1ietpod oF 
41goritpjz 8.31ieyGie 也 EGJoral7=0,12.... 


Proof Itis sufficient to prove that for Yo E G the numerical Solution (421 of the 
Runge-Kutta method applied to (8.9) lies in g. Since the Lie bracket [2, 4] is an 
operationg xg 一 g,andsinceexp(f)YoeGfor2 sg,theright-hand expres- 
Sion of (8.9) is in g for (2 E g. Hence, (8.9) is a differential equation on the vector 
Space g with solution 12( 轨 E g. All operations in a Runge-Kutta method give 
results in g, so that the numerical approximation f21 also lies in g. 














Theorem 8.S. 矿 1Pe Rage-KUtta 711etpod 1 of (classica1) order D da11d 矿 1pe 1rU10- 
CQ1iO1 Idex 1 (6.9) Satisjiey dg 之 了 一 2 11e1 1je 1etpod or4lgoritpza 8.371 of 
O1der 7. 


Proof For sufficiently smooth 4(Z) we have 0 人 = 14( 加 ) + O( 人 2) YY = 
了 +TOWand[2g 轨 ,47O)] = O( 妈 ). This implies that ad 9 四 (4(7 执 )) 三 
O(t+l),sothatthetruncation ofthe series in (8.9) induces an error of size O(Pa+2) 
for | 直 和 凡 .Hence, for dg 十 2 > D, this truncation does not affect the order of 
Convergence. 














The most Simple Lie group method is obtained 让 we take the explicit Euler 
method as basic discretization and dg = 一 0in (8.9). This leads to the So-called Zrze 一 
尼 ULer 1Ietpod 

区 H1 = exp(P4( 攻 )) 隔 . (8.10) 


This is also a Special case of the Crouch-Grossman methods of Definition 8.1. 
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Taking the implicit midpoint rule as the basic discretization and again qd = 0 in 
(8.9), we obtain the Zie 1idpoit Fie 


到 1 = exp(D) 攻 ， =j4(exp(2/2) 邦 ). (8.11) 


This is an implicit equation in Y2 and has to be Solved by fixed point iteration or by 
Newton-type methods. 


了 xample 8.6， We take the coefficients of the right array of Table 8.1. They give rise 
to 3rd order Munthe-Kaas and 3rd order Crouch-Grossman methods. We apply both 
methods with the large step Size 玫 三 0.35 to the system (1.3) which is already of the 
form (8.1). Observe that 0 is a vector in 及 3 and not a matrix, but all results of this 
Section Temain valid for this case. For the computation of the matrix exponential we 
use the Rodrigues formula (Exercise 17). The numerical results (first 1000 steps) are 
Shown in Fig.8.1. We seethat the numerical solution stays on the manifold (Sphere)， 
but on the Sphere the qualitative behaviour is not correct. A Similar behaviour could 
be observed for projection methods (the orthogonal projection consists Simply in 
dividing the approximation YY +1 by its norm) and by the methods based on local 
coordinates. 


Crouch-Grossman methods and Munthe-Kaas methods are very Similar. If they 
are based on the Same Set of Runge-Kutta coefficients, both methods use s evalu- 
ations of the matrix 4(). The Crouch-Grossman methods require in general the 
computation of s(s 十 1)/2 matrix exponentials，whereas the Munthe-Kaas meth- 
ods require only s of them. On the other hand, Munthe-Kaas methods need also the 
computations of a certain number of commnutators which increases with g in (8.9). 
In Such a comparison one has to take into account that every classical Runge-Kutta 
method defines a Munthe-Kaas method of the Same order, but Crouch-Grossman 
methods of high order are very diffticult to obtain，and need more Stages for the 
Same order (ED > 人. 





Munthe-Kaas method 





Fig. 8.1. Solutions of the Euler equations (1.4) for the rigid body 
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IV.8.3 Further Coordinate Mappings 


The methods of Algorithm 8.3 are based on the local parametrization W(2) = 三 
exp(T2) 了 .For all Lie groups, this is a diffeomorphism between the Lie group and 
the corresponding Lie algebra. Are there other computationally more effcient para- 
metrizations that can be used in Special Situations? 


The Cayley Transform. Lie groups of the form 
G={t7lY7PY=P}， (8.12) 


where 已 is a given constant matrix, are called quadratic ZLie 81oUDS. The corre- 
sponding Lie algebra is given byg = {Q PC2+OLP = 0}. The orthogonal 
group O(m) and the symplectic group Sp(m) are prominent Special cases (see Ta- 
ble 6.1). For such groups we have the following analogue of Lemma 6.4. 


Lemma 8.7. For a guad1rafic Zie 81oP @， 1 加 e Cayiey 11ra1zs1o7771 

cay2 =(T 一 DO)-IT+I) 
11GDS ele111e11g or g ito G. Moreovep 11 aocal dijeomorpjis1a iear 02 = 0. 
Proof ForP cgke,P2+PIP=0wehavePI+TOO=(T-D)7Pand 


also PT- 所 )-1=(T+DTTPEFory=(T-D)-I0C+D)this immediately 
implies Y7 PY = 书 . 














The use of the Cayley transform for the numerical integration of diffterential 
equations on Lie groups has been proposed by Lewis 儿 Simo (1994) and Diele， 
Lopez 儿 Peluso (1998) for the orthogonal group, and by Lopez 久 Politi (2001) for 
general quadratic groups. It js based on the following result which is an adaptation 
of Lemma IIU.4.1 and Lemma IIU.4.2 to the Cayley transform. 


Lemma 8.8. 771e derivative of cayf2 1 give Dy 


(二 cay 2) 忆 = (dcay po( 酉 ) cay 92， 


WPeye 


dcaypo( 百 ) =2(0T 一 D) 开 万 十) (8.13) 


Fortipe iverse of dcay 2 we jave 
dcay Fl 人 万) = 5(- DTUT+ D)， (8.14) 
Proof By the usual rules of calculus we obtain 


(二 cay 2) 刀 =(T-D-II-OTIO+D+U- D)-IR 











and a Simple algebraic manipulation proves the statements. 
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The numerical approach for Solving (8.1) in the case of quadratic Lie groups 
is an adaptation of the Algorithm 8.3. We consider the local parametrization Y 一 
UW(D) = cay( 人 1) 世 ,and weapply one step ofanumerical method to the differential 
equation f2 = dcay D4( cay() 丈 ) which, by (8.14), is equivalent to 


已 = 3 谤 O)4( cay (2) 瑟 )( 上 + O)， 


This equation replaces (8.9) in the Algorithm 8.3. Since no truncation of an infinite 
Sefies li$ neceSSary here, this approach is a Special case of Algorithm 5.3. 


Canonical Coordinates of the Second Kind，For a basis {Ci, Co,. ,Co of 
the Lie algebra g the coordinates 21,...,zu of the local parametrization 录 (z) 一 
exp(? 2Ci) of the Lie group CG are called ca1zo7mical coo1dizzates or ipe Jst 
[11d. Here we are interested in the parametrization 


U(z) 一 exp(21C1) exp(z2Co2) 瑟 0 exp(zdCu)， (8.15) 


and we call z = (2 .2d)7 canonical coordinates oftjpe secomnd iad (Varadara- 
jan 1974). The use of these coordinates in connection with the numerical Solution 
of differential equations on Lie groups has been promoted by Celledoni 儿 Iserles 
(2001) and Owren 多 Marthinsen (2001). The idea behind this choice ls that, due to 
a Sparse Structure of the Ci, the computation of exp(z21C1), ,exp(zdzCu) may be 
much cheaper than the computation of exp(> ,; 2iCi). 

With the change of coordinates y = W(z), the differential equation (8.1) be- 
comes 录 (2)3 = 4((V(2))%(2)， which is equivalent to 


dd 
4(W%(2)) 一 >》， 入 exp(21C1) 和 exp(2i-1Ci 1) 
.Ci exp( 一 2-_1Ci_1) .exp( 一 21C1) (8.10) 
大 ( 焉 oo 瓦 -1)Ci 


| 
[Ms 


Where we use the notation 17C = exp(2j1C1) Cexp( 一 2Cj) forthe linear operator 
1 :8 一 gseeExercise 12. We need to compute 习 , ,2 from (8.16), and this 
will usually be a computationally expensive task. However for several Lie algebras 
and for well chosen bases this can be done very effticiently. The crucial idea is the 
following: we let 瓦 be defined by 


六 训 二 (8.17) 


肌 CE ii> 
CI， ii< 


and we assume that 


(下 oo 下 CE= (下 oo 而 CO， =2.d. (8.18) 
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Under this assumption，we have (五 6 瓦 -1)C; 全 ( 翅 5 1)Ci 大 
( 互 oO ...oO 已 Cu and the relation (8.10) becomes 


QI 


( 仙 。.…。 遍 1( > 2) = 4(%(3) (8.19) 


In the situations which we have in mind, the operators 态 can be effciently inverted， 
and Algorithm 5.3 can be applied to the Solution of (8.1). 

The main difficulty of using this coordinate transform is to fnd a Sultable or- 
dering of a basis Such that condition (8.18) is Satisfied. The following lemma Sim- 
plifties this task. We use the notation ak(C) for the coefficient in the representation 


C= 二 5-iak(C)Cr. 


Lemma 8.9. Let {C1 .Co pe apasis of tpe Lie algepja g. 矿 Jor every pair 
IT<iandjJor8<7TweHave 


ak(7Ci) 天 0 一 全 Fe 一 CR Jorlsaishig KE<L<7 (8.20) 
pen ijpe 1elation (8.718) po1dsy 1jor al11 三 2 .04. 


Proof We write 瓦 _1C， = FL1Ci = > 和 ok( 1Ci)CK. Ifolows from the 
definition of 耗 and from (8.20) that ( 瓦 _>o 态 1)C; 一 ( 忆 -2o 甩 -1)C5.Arepeated 
application of this argument proves the statement. 














Owren 久 Marthinsen (2001) have studied Lie algebras that admit a basis Satis- 
fying (8.18) for all >. We present here one of their examples. 


了 上 xample 8.10 (Special Linear Group). Consider the differential equation (8.1) 
on the Lie group SL(p) = { 立 | detyY = 1},ie.,the matrix 4(Y) lies in 5[(m) 一 
{41trace4 = 0}. As abasis of the Lie algebra 5[(m2) we choose Pij = ezef for 
1 天 妃 and Di = eiel 一 ei+1le1 for1 < ii <7(hereei 一 (0 1. .0) 并 
denotes the vector whose only non-zero element is in the ?th position). Following 
OWwren 久 Marthinsen (2001) we order the elements of this basis as 


Bi2< .< Enm<PEo3< .<Eon< .< 已 1n 
< Bali<...<Eni<Esz<.. .<Pna< .< 已 nn-1l 
< .Di<.. .< 也 1. 


With the use of Lemma 8.9 one can check in a straightforward way that the relation 
(8.18) is satisfied. In nearly all Situations ak (7C5) = 0forE < 了 < isSothat 
(8.18) represents an empty condition. Consequently, the 为 can be computed from 
(8.19). Due to the sparsity of the matrices 有 ij and Di the computation of 开 Can 
be done very effciently. 
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IV.9 Geometric Numerical Integration Meets 
Geometric Numerical Linear Algebra 


The persistent use of orthogonal transformations is a hallmark of numerical linear 
algebra. Correspondingly, manifolds incorporating orthogonality constraints play 
an important role all over this feld; see Edelman，Arias 儿 Smith (1998) on the 
geometry of algorithms with orthogonality constraints. In addition to the orthogonal 
group O(m), the manifolds of primary interest are: 


)mk，the Stiefel manifold of 7 x matrices with K orthonormal columns， 

9 k，the Grassmann manifold of orthogonal projections of 及 ”onto K-dimensional 
Subspaces, and 

1 the manifold of mm x 7 matrices of rank 5， which is related to orthogonal 
transformations via the Singular value decomposition and a related decomposi- 
tion discussed below. 


IV.9.1 Numerical Integration on the Stiefel Manifold 


The original motivation for Stiefel Manifolds 
(in 9tiefel 1933) was the topological problem， 
whether a manifold A4 can possess K everywhere 
linearly independent continuous vector fields. The 
problem, which had been solved for the case 8 三 
1,， was much harder for g > 1. In order to attack 
this question, Stiefel introduced his” manifold 





Ri|Y7Y 三 丰 (9 


as an auxiliary tool for the definition of what later 
became known as the Stiefel-Whitney classes6. 

Here，we are interested in computations on 
these manifolds for their own，with many appli- 
cations, as for example the computation of Lya- 
punov exponents of differential equations; See Ex- 
ercise 22 as well as Bridges 儿 Reich (2001) and Dieci Russell 儿 van Vleck (1997). 
There are also many cases where orthogonality constraints concern only Some of 
the variables in a differential equation. In molecular dynamics, for example,， such 
orthogonality constraints arise in the Car-Parrinello approach to GaP 7ito molecu- 
lar dynamics (Car 贸 Parrinello 1983) and in the multiconfguration time-dependent 
Hartree method of quantum molecular dynamics (Beck, Jackle，Worth 人 贸 Meyer 
2000). 


队 
NAN 


Eduard Stiefel5 


5 Eduard L. Stiefel, born: 21 April 1909 in Zrich, died: 23 November 1979; photo: Bil- 
darchiv ETH-Bibliothek, Ziirich. 
5 We are grateful to our colleague A. Haefliger for this indication 
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Tangent and Normal Space. We choose a fixed matrix Y in the Stiefel manifold 
7 = ]m5. Then the tangent Space (5.4) at Y E Y consists of the matrices 2 such 
that ( 十 =2)7(7 + <s2) remains Tfor es 一 0. Differentiating we obtain 


TVY={2ZeR"x|277+Y72Z = 0}， (9.2) 


ie.,Y7Z is Skew-Symmetric. This Tepresents 二 大 (大 十 1) conditions, thus Th is of 
dimension mK 一 起 (有 十 了 ). 
For defining the normal Space, we use the standard Euclidean inner product on 
了 有 ?2xR， ie.， 
(4,B) = trace(47 太 ) = i ai7pi7， (9.3) 


Whose corresponding norm is the Frobenius norm 


Ilm = V，o2 (9.4) 


Then the normal Space at Y is given by 
PNyY={KER7" 人 | 开 LT ={79S|59symmetricKE xmatrix}， (9.5) 


To show this, we observe thatthe orthogonalityY 9 上 Tym7 follows from (了 9, 2 = 
trace(SY72) = (9,Y72) and the fact that any symmetric matrix 4 is orthogonal 
to any Skew-symmetric matrix 妃 .”" A dimension count (the matrix S has 二 大 ( 天 十 二 ) 
free elements) now Shows us that the Space defined in (9.5) fjlls the entire orthogonal 
complement of 7Ty-)7. 


Orthogonajlity-Preserving Runge- 玫 utta Methods. Suppose now that we have to 
Solve a differential equation 立 王 下 () on a Stiefel manifold 7. The orthogonality 
constraints Y7Y 一 Tare preserved, ifthe derivative 玉 (Y) lies in the tangent space 
TD ie 让 已)TY +Y7RF(Y) = 0, for every 了 E 7Y (weak invariants，see 
Sect.IV.4). In the (exceptional) case where they are in fact true invariants,， 1l.e., 让 
瓦 (了 ) 科 十 YLRFO7) =0fora1 六 e 玉 "xA,then the orthogonality constraints are 
quadratic, and are therefore preserved exactly by the implicit Runge-Kutta meth- 
ods of Sect.IV.2.1, in particular the Gauss methods. These methods give numerical 
solutions on the Stiefel manifold, but use function evaluations outslde the manifold. 
In the general case of only weak invariants, a standard approach for enforcing 
orthogonality is the introduction of Lagya1zge 1IULiPLiers, which can be interpreted 
as artificial forces in the direction of the normal Space keeping the solutions on the 
manifold. Due to the structure of Ny) (See (9.3)), the problem becomes here 


立 = FJ)+Y4，Y7Y=T (9.0) 


with a symmetric Lagrange mnultiplier matrix 4 E 了 8xz; see also Exercise 10. 
Any numerical method for differential-algebraic equations can now be applied, e.g.， 


7 Indeed, split the sum in (9.3) in two partsz < 7 andi > 记 and interchangei < Jinthe 
Second sum. Then both sums are identical with opposite Sign. 
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appropriate Runge-Kutta methods as in Chap.VI and Sect.VJIL.4 of Hairer 儿 Wan- 
ner (1990). A Symmetric adaptation of Gauss methods to Such problems is given by 
Jay (2005). 

Below we shall study in great detail mechanical Systems with constraints (See 
Sect.VIL1). In the case of orthogonality constraints, Such problems can be treated 
Successfully with Lobatto IIA-IUB partitioned Runge-Kutta methods, which in ad- 
dition to orthogonality preserve other important geometric properties Such as Te- 
VerSsibility and Symplecticity. 











(a) | (bp) 


Fig. 9.1. Projection onto the Stiefel manifold using the Singular value decomposition 


了 Projection Methods. If we want to use the projection method of Algorithm 4.2, we 
have to perform, after every integration Step, the projection (4.4), which requires to 
fnd for any given matrx YY amatrx yy E with 


| 严 一 交 | 一 min (9.7) 


This projection can be obtained as follows: 让 yis not in 7 (but close), then its 
column vectors , .人 久 will have norms different from 1 and/or their angles 
will not be Tight angles. These quantities determine an ellijpsoid, 让 we require that 
these vectors represent conjugate diameters 8 (see Fig. 9.1 (a)). This ellipsoid is then 
transformed to principal axes in 及 & by an orthogonal map U7 (picture (b)). We let 
01;,...,Ok be the length of these axes. Ithe coordinates are now divided by ci， 
then the ellipsoid becomes the unit sphere and the vectors U7 芒 become orthonor- 
mal vectors L7 工 7 These vectors, when transformed back with ,lie in 7” and are 
the projection we were Searching for (Picture (c)). For a proof of the optimality, See 
了 Exercise 21. 


Co111ectio1 Witi 1Pe 9108UGr Wue Decomtpositio1. We have by construction that 
TILW 三 了 -107 芒 where 研 = diag(ol,. .ob).Iwefinally map these vectors by 
an orthogonal matrix Y to the unit base, we See that 了 之 -177= or 


艺 = TDVT (9.8) 


Which is the Si12gMLar Value deco11Ipositiom of 区. This connection allows us to use 
standard software for our calculations. The projected matrix is then Y = TV 


8 Here we touch another of Stiefel's great ideas, the CG algorithm. 
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Re110K 7. When the differential equation posSesses Some SYymmetry (See the next 
chapter), then the yz111etric projection algorithm V.4.1 ls preferable to be used 
instead. 


ReraaK 2. The above procedure is equjivalent to the one proposed by D. Higham 
(1997): the orthogonal projection is the first factor of the polar decomzpositioz 了 一 
工 甩 (where Y has orthonormal columns and is Symmetric positive definite). The 
equivalence is seen from the polar decomposition 了 = (CQVZ(VZV7T). Arelated 


procedure,，where the first factor of the C 有 decomposition of Y is used instead of 
that of the polar decomposition, is proposed in Dieci, Russell & van Vleck (1994). 


Tangent Space Parametrization. For the appli- 
cation of the methods of Sect.IV.3, in particular 了 Pr() 
Subsection IV.9.3, to the case of Stiefel manil- 也 
folds，we have to find the formulas for the pro- 
jection (9.8) (see the wrap figure). 
Foraftxedy ,letyY 二 2beanarbitrary matrix 
inyY 十 Ty),for which we search the projection 
Wy(2) to 7. Because of the structure of JVyT 
(See (9.9)), we have that 





wy(2) = 并 十 和 二 YS (9.9) 


is a local parametrization of 7, 让 9 is symmetric and 这 wy(2)7wWy(2) = 工 This 
condition, when multiplied out, Shows that 9 has to be a solution of the algebraic 
Riccati equation 


S2 二 2S5+5SY712+ZTYS+2ZIZ = 0. (9.10) 


Observe that for 8 三 1, where the Stiefel manifold reduces to the unit Sphere in 
及", the equation (9.10) is a scalar quadratic equation and can be easily Solved. For 
到 > 1, it can be Solved iteratively using the scheme (e.g., Starting with 90 = 0) 

(IT 十 27Y)S 十 SIT+Y72) =-217 一 92 | 
Using a Schur decomposition Y72 = Q7LRQ (where Q is orthogonal and 尺 upper 
triangulan, the elements of Q.S,Q7L can be computed successively starting from the 
left upper corner. We refer to the monograph of Mehrmann (1991) for a detailed 
discussion of the Solution of linear and algebraic Riccati equations. 

Next we compnute for the matrix 已 its orthogonal projection Py( 五 ) to Ty， 
ji.e., by (9.$), we have to find a symmetric matrix S such that Pr (四 ) 三 下 一 六 5. 
The tangent condition Py(F)TYTYTPvr(P) = 0leadstoS5= (FITY+TYT7P/2， 
SO that 

Pyr(Z = 古 一 人 十 YY 站). (9.11) 


With the parametrization wyr(2) of (9.9) the transformed differential equation， 
when projected to the tangent Space, yields 


纺 = PrF(yyv(2))， (9.12) 
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in complete analogy to (3.9). The numerical solution of (9.12) requires, for every 
function evaluation, the Solution of the Riccati equation (9.10) and the computation 
of a projection onto the tangent space, each needing O(mk2) operations. Compared 
with the projection method, the overhead (i.e., the computation apart from the evalu- 
ation of 严 ( 疡 )) is more expensive, but the approach described here has the advantage 
that all evaluations of 严 are exactly on the manifold )”. 


TIV.9.2 Differential 上 Equations on the Grassmann Manifold 


The Grassmann manifold is obtained from the Stiefel manifold by identifying ma- 
trices in )]m,k that Span the Same Subspace (See Fig.9.2 (a)). 9ince any two Such 
matrices result from each other by right multiplication with an orthogonal 太 X 无 
matrix, the resulting manifold is the quotient manifold 


9 三 2 KE/O(UE). (9.13) 


An equivalence class [了 E 9 defines an orthogonal projection 书 三 YY7 of 
Iank 5, and conversely, every orthogonal basis of the range of 尸 yields a represen- 
tative Y E )mk. We can thus view the Grassmann manifold as 


(9.14) 


书 is an orthogonal projection onto } 
aK-dimensional Subspace of 豚 ” 


5 





Fig. 9.2. Integration of a differential equation on the Grassmann manifold 


The Tangent Space. The map Y 一 尸 =YY7 fromm 一 9 has the tangent map 
(derivative) ? 


TyyY 一 Tp9: 07 一 6P=577Y77 十 YY (9.15) 


and we wish to apply all the methods for 7Yy) from the arsenal of the preceding 
Section to problems in TPG9. However, the dimension of TPG is by 二 人 天 一 革 ) lower 
than the dimension of Ty ?77. This difference is the dimension of O(E) and also of 


9 Here we write 6Y for tangent matrices at Y (what has been 2 in (9.2)), and similarly 
for other matrices; Lagrange'”s 0-notation here becomes preferable, since we will have， 
especially in the next subsection, more and more matrices moving around. 


130 IV，Conservation of First Integrals and Methods on Manifolds 


5$0([), the vector space of skew-symmetric K x K matrices. The key idea is now 
the following: 证 we replace the condition from (9.2), Y76Y skew-symmetric, by 
Y76Y = 0, then we remove precisely the superfluous degrees of freedom. Indeed， 
the extended tangent map 


TYy7 一 TPp9x5o( 和 :6 关羽 (OY77 十 Y67 7 57) (9.16) 


is an isomorphism, Since it is readily Seen to have zero null-space and the dimensions 
of the vector Spaces agree. The tangent Space is thus characterized as 


7TpP9={6P=67Y7+Y6Y7|1Y757 = 0}， (9.17) 


and every 6P < TP9 corresponds to a unique 6Y with Y76Y = 0. Note that this 
condition on 60Y does not depend on the representative 了 of [到 ]. 


Differential Equations. Consider now a differential equation on 9， 
已 = G(CP)， (9.18) 


with a vector field C on 9. The condition CI( 忆 ) e 7TP9G9 means, since the tangent 
map (9.15) is onto, that there exists for 已 = YY7 a vector 严 ( 了 ) such that 


G(P)=FODYT +TYPFOI wwibh 到 +Y7TRP=0 (9.19) 


ie., 下 (了 ) E 7Ty7. Howeven from a given initial position Y, there are many 已 
which produce the same movement C of the subspace represented by 书 (see Fig.9.2 
(b)). By (9.16),the movement of Y becomes unique 让 werequire that this movement 
is o1tjpogo1al to the Subspace (See Fig.9.2 (c))， 


Y7Y = 0. (9.20) 


Multiplying the derivative 忆 一 这 十 YYT with Y7 from the left we obtain， 
under condition (9.20), Y7P=Y7 and， by (9.18) and (9.19), Y 三 YFTT 十 已 or 


立 = (IT 一 YY 人 ) 忆 2)， (9.21) 


Geometrically, this means that the vector 下 ( 光 )，which could be chosen arbitrarily 
in7Yy),is projected to the orthogonal complement of the Subspace Spanned by 了 or 
书 王 YY7. The derivative Yin (9.21) is independent of the particular choice of 亚 . 

Equation (9.21) is a differential equation on the Stiefel manifold 7” that can be 
Solved numerically by the methods described in the previous Subsection. 


了 上 xample 9.1 (Oja Flow). A _ basic example arises in neural networks (Oja 1989): 
Solutions on )2, of the differential equation 


立 = (一 YY7T)4Y (9.22) 


with a constant Symmetric positive definite matrix 4 E 耻 " ”tend to an orthogonal 
basis of an invariant Subspace of 4 as 上 一 co (Yan, Helmke & Moore 1994). 
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Anaive comparison of this equation with (9.21) would lead to 下 () = 4Y, but 
this function does not satisfy the tangent condition F7Y 十 Y7 尺 = 0from (9.19). 
So we use the fact that (了 一 YY7)2 = 了 一 YY7 and set 忆 (7) = (一 YY 人 247. 
With this, G( 己 ) ffom (9.18) and (9.19) becomes 


疡 =(T-P 六 4P+P40T- 亡 ). (9.23) 


We have obtained the result that equation (9.22) can be viewed as a differential 
equation on the Grassmann manifold OK. 
However for the numerical integration it ls more practical to work with (9.22). 


IV.9.3 Dynamical Low-Rank Approximation 


Low-rank approximation of large matrices ls a basic model reduction technique in 
many application areas, Such as image compression and latent Semantic indexing in 
information retrieval; See for example Simon 儿 Zha (2000). Here, we consider the 
task of computing low rank approximations to matrices 4(t) E 琢 内 xm depending 
Smoothly on 刀 At any time 妃 a best approximation to 4(b of rank Ris a matrix 
Xinthe manifold 人 AMfz = 人 ”ofrank-K matrices that Satisfies 


X 人 (be suchthat |X 办 一 4 = minl (9.24) 


The problem is solved by a singular value decomposition of 4(, truncating all 
Singular values after the 7 largest ones. When the matrix ls So large that a complete 
Singular value decomposition ls not feasible, a standard approach to obtain an ap- 
proximate solution is based on the Lanczos bidiagonalization process with 4( 轨 ,as 
discussed in Simon 儿 Zha (2000). 

Following Koch 久 Lubich (2005)，we here consider instead the low-rank ap- 
proximation 世 ( 力 E 人 fk determined from the condition that for every t the deriva- 
tive 立 (, which is in the tangent space TYy(D 人 fk, be chosen as 


YY 人 ETyroDAK suchthat | 立 昌 一 4 = minl (9.25) 


This is complemented with an initial condition, ideally 痰 (如 ) = X(to). For given 
立 ( 划 ,the derivative Y(b is obtained by a 1inear projection, though onto a solution- 
dependent vector space. Problem (9.25) yields a differential equation on 人 4 We 
will see that with a suitable factorization of rank-K matrices，we obtain a System 
of differential equations for the factors that ls well-suited for numerical integration. 
The differential equations contain only the increments 4()， which may be much 
Sparser than the full data matrix 4 ( 萎 . 

Koch 廊 Lubich (2005) show that 了 (tb yields a quasi-optimal approximation 
on intervals where a good Smooth approximation exlsts. It must be noted, however， 
that the best rank-8 approximation X (加 may have discontinujities，which cannot 
be captured in 节 ( 坟 . This is already seen from the example of finding a rank-1 
approximation to diag(e-t et where starting from 如 < 0 yields 匡 ( 力 = 节食 三 
diag(e- 罗 0) fort < 0,buty 信 = diag(e-0) andXO = diag(0,eb fort > 0. 
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The best approximation X(t has a discontinuity at 上 = 0, caused by a crossing of 
Singular values of which one is Inside and the other one outside the approximation. 
An algorithmic remedy is to restart (9.25) at regular intervals. 

In contrast to (9.24), the approach (9.25) extends immediately to the low-Trank 
approximation of solutions of matrix differential equations 4 = 下 (4). Here， 
4(b in (9.25) is simply replaced by the approximation 严 ( 了 (办 ), which yields the 
minimum-defect low-rank approximation Y(t) by choosing 


YeETyAk suchthat | 立 一 下 (2 = minl (9.26) 


An approach of this type ls of common use in quantum dynamics,， where the phys- 
ical model reduction of the multivariate Schrodinger equation by the analogue of 
(9.20) ls known as the Dirac-Frenkel time-dependent variational principle，atfter 
Dirac (1930) and Frenkel (1934); see also Beck, Jackle，Worth 久 Meyer (2000) 
and Sect. YI.0. 


Decompositions of Rank- 上 Matrices and of Their Tangent Matrices. Every real 
Tank-K matrix of dimension 7 x 7 can be wiitten in the form 


科 = USV (9.27) 


Where 7 E )mk and YY E ) 思 5 have orthonormal columns, and 9 E 有 R&xR is 
nonsingular. The singular value decomposition yields 9 diagonal, but here we do 
not asSsume a Special form of 5. The representation (9.27) is not unique: replacing 
UVbyU7=VPandvbyyYy =YQwith orthogonal matrices 已 CQ E O(K) and 
Correspondingly 9 by S-==- P715Q, yields the same matrix 和 = USV7L = 15V7. 

As a Substitute for the non-uniqueness in (9.27),， we use - as in the previous 
Subsection - a unique decomposition in the tangent Space. Every tangent matriX 
0O7Y ETyrA at =USVYTis ofthe form (see Exercise 23) 


0 一 6U5SVT TU5SV 工 十 US5VT， (9.28) 


Where 59 < 有 8xe and 6U E Tri 6 E Tri Conversely, 65,5U, 5V are 
uniquely determined by 0Y 让 we impose the orthogonality constraints 


UV75U7 =0， VY76Y = 0. (9.29) 
Equations (9.28) and (9.29) yield 


69 = TU767T 
6U = (T 一 TU)5YVS-L， (9.30) 
6 忆 7=(T-YV7T)77U5S 一 . 


Formulas (9.28) and (9.30) establish an isomorphism between the Subspace 
{(65,6U 7) E 了 xx 有 mx xx 了 "xx|1177607 =0,V757 =0} 


and the tangent Space TYyA 人 4 
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Differential Equations for the Factors. The minimization condition (9.25) is 
equivalent to the orthogonal projection of 4( 为 onto the tangent space Ty(DA4k: 


find y < 7TyA 人 KK (we omit the argument 为 Satisfying 
( 袜 一 4,6Y7)=0 forall 7 ETyrAh， (9.31) 


with the Frobenius inner product (4, B) = trace(47 成 ). With this formulation we 
derive differential equations for the factors in the representation (9.27)， 


Theorem 9.2， For 了 = VSVYT E AR wii nonsinguiar 9 E 要 28x ad wiip 
DERmxeand YE 可 "x jaVviz28 OPo11OT1IGL coO1U112129 COGifiO1 (9.25) 0r(29.37) 
iedguivalentio 和 = 一 USVT UV 上 +TUSVT where 

上 UV74T 

TI = (U-TITD4V5S-1i (9.32) 

7 = (U-TYYVYDD47U05-7 


Proof ForwE 有 mm, VER"and 巨 ER weusethe identity 
(ui 了 B) = Bu 


In view of (9.29) weredquire VIU =TYT7 =0 along the solution trajectory in order 
to define a unique Tepresentation of Y. We first substitute 5Y 一 2 了， for ?7，7 三 
1 .jin(9.31), where ivj denote the columns of Z，V ,respectively. This is of 
the form (9.27) with oV = 0Y = 0 and one non-zero element in 09. In this way we 
find 9 = UVT4V Similarly choosing 6Y = 六 | 0 si7j07 ， 1 一 1 ...,K， Where 
6uU E 月 岂 is arbitrary with UL6w = 0, we obtain the stated differential equation 
for V, and likewise for 0Y 一 民生 7S7i6U7 with VY76u = 0 the differential 
equation for 了 . 














The differential equations (9.32) are closely related to differential equations for 
other Smooth matrix decompositions, in particular the smooth singular value decom- 
position; See, e.g., Dieci & Eirola (1999) and Wright (1992). Unlike the differential 
equations for singular values given there, the equations (9.32) have no Singularities 
at points where singular values of 世 ( 贡 coalesce. 

For the minimum-defect low-rank approximation (9.26) of a matrix diffterential 
equation 4 = 下 (4), we just need to replace 4 by F(Y) forY = USVT in the 
differential equations (9.32). 

The matrices Land Y( 为 evolve on Stiefel manifolds. The differential equa- 
tions (9.32) can thus be Solved numerically by the methods discussed in Sect.IV.9.1. 
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1.， Prove that the Symplectic Euler method (1I.1.9) conserves quadratic invariants 
of the form (2.3). Explain the “0”entries of Table (1.2.1). 
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. Prove that under condition (2.3) a Runge-Kutta method preserves all invariants 


ofthe form 7(J) 三 VCVY 二 dy 二 c. 


.Prove that an s-stage diagonally implicit Runge-Kutta method (i.e., ai 三 0 for 


1z<7I)satisfies the condition (2.3) 寺 and only ifit is equivalent to a composition 
2ojo...o9gonbased onthe implicit midpoint rule. 


. Prove the following statements: a) Ia partitioned Runge-Kutta method con- 


Serves general quadratic invariants DZCp 十 2D7 .Do 十 oo, then each of the 
two Runge-Kutta methods has to conserve quadratic invariants Separately. 

b) 开 both methods, {pi, ai7} and 人 2 are irreducible, satisfy (2.3) and 让 
(2.7)-(2.8) hold, then we have 5 一 已 and aij 一 Qi for all 7,7. 


.Prove that the Gauss methods are the only collocation methods Satisfying (2.3). 


DA Use the jdeas ofthe proof of Lemma 13.9 in Hairer 儿 Wanner (1990). 


. Discontinuous collocation methods with either bi 夭 0 or b。 天 0 (Defini- 


tion JI.1.7) cannot Satisfy the criterion (2.3). 


. (9anz-Serna 久 Abia 1991, Saito, Sugiura 儿 Mitsui 1992). The condition (2.3) 


acts as Simplifying assumption for the order conditions of Runge-Kutta meth- 
ods. Assume that the order conditions are Satisfied for the trees and v. Prove 
that it ls Satisfied for ww ov 丰 and only ifitis satisfed for vo ,and that it ls 
automatically Satisfied for trees of the form  o ?. 

Re1a1K ovw denotes the Butcher product introduced in Sect. VI.7.2. 


. 了 Zo is a symmetric，tridiagonal matrix that is suffticiently close to 4 = 


diag(Al,...,Xn), where Al > Xa > ...> 》Xnaretheeigenvalues of Zo, then 
the solution of (3.5) with 妃 ( 厂 ) = 也 + 一 7 converges exponentially fast to the 
diagonal matrix 4. Hence, the numerical solution of (3.5) gives an algorithm 
for the computation of the eligenvalues of the matrix 了 0. 

BE Let DO be the entries in the diagonal of 也 and a1 ,an 1 
those in the subdiagonal. Assume that |Bj(0) -- XAz| 和 尺 /3 and |ax(0)| 蔷 慌 
with Some sufficiently Small 尺 . Prove that 厌 昌 一 Bi 之 一 尺 and 
ak 人 (< Re-w-mtforalt > 0, where 风 一 mink(X 一 Xe) > 0. 


. Elapborate Example 4.3 for the Special case where Y ls a matrix of dimension 


2. In particular show that (4.60) ls the Same as (4.9), and check the formulas for 
the Simplitied Newton iterations. 

(Brenan，Campbell 儿 Petzold (1996)，8Sect.2.5.3). Consider the differential 
equationy = jy) with known invariants g(y) = Comst and assume that 9 (y) 
has full rank. Prove by differentiation of the constraints that, for initial values 
satisfying 9(y0) = 0, the solution of the differential-algebraic equation (DAE) 


六 = jg) 上 9 
0 = 909) 


also solves the differential equation y 下 )， 

RemaK Most methods for DAESs (e.g., stiffly accurate Runge-Kutta methods 
or BDF methods) lead to numerical integrators that preserve exactly the con- 
straints 9g(y) = 0. The difference from the projection method of Sect.IV.4 is 
that here the internal stages also Satisfy the constraint. 


11. 


12. 


13. 


14. 


15. 


10. 


17. 


18. 


19. 
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Prove that SL(P) is a Lie group of dimension mn2 -- 1, and that 5[(P) is its Lie 
algebra (See Table 6.1 for the definitions of SL(Pz) and 5[(7z)). 
Let CG be a matrix Lie group and g its Lie algebra. Prove that for 了 E Gand 
4cgwehaveYy4yY-leg. 
已 AL Consider the path 7( 一 站 a( 世 7 一. 
Consider a problem 了 = 4(YZ)Y, for which 4(Y) e 5o(n) whenever YE 
O(mP), but where 4( 站 ) is an arbitrary matrix foryY & O(m). 
a) Prove that mm E O(P) impliesY(bEO(m) forall 世 
b) Show by a counter-example that the numerical solution of the implicit mid- 
point rule does not necessarily stay in O(7m). 
(Feng Kang 人 Shang Zai-jiu 1995). Let 尺 (z) = (1 二 z/2)/(1L ->z/2) bethe 
stability function of the implicit midpoint rule. Prove that for 4 e 5[(3) we 
have 

det RPR4) =1 今 det4=0. 


(Iserles 久 Ngrsett 1999). Introducing yi 三 V and ya2 三 ,write the problem 


少 十 怒 = 0， 4(0) = |， 2(0) =0 


in the form (7.0). Then apply the numerical method of Example 7.4 with dif- 
ferent step Sizes on the interval 0 冬 二 和 100. Compare the result with that 
obtained by fourth order classical (explicit or implicib Runge-Kutta methods. 
RemaaK 开 4( 均 in(07.6) (or 4( 坊 V) in(8.D) are much smoother than the solu- 
tion y( 轨 ,then Lie group methods are usually superior to standard integrators， 
because Lie group methods approximate 4( 如 ,whereas standard methods ap- 
proximate the solution y( 妇 by polynomials. 

Deduce the BCH formula from the Magnus expansion (IV.7.5). 

届 L For constant matrices 4 and 已 consider the matrix function 4( 为 defined 
by 4 人 归 =Bftfor0<t< 和 land4 电 =4forl<t<2. 

(Rodrigues formula, see Marsden & Ratiu (1999), page 291). Prove that 


2 0 一 人 CU2 
) 22 for 人 2 王 LI3 0 一 以 于 
一 (2 CU1 0 





Sin CQ 1 /Sin(Q/2 
exp(D) = 7 十 一 2+ 了 人 ( 站 





where aw = Vwi 十 wz 十 w3. This formula allows for an efficient implementa- 
tion of the Lie group methods in O(3). 

The solution of 了 = 4( 切 ) 甩 YY(0) = 节 ,is given byY( = exp(2()2， 
where 1V2() solves the differential equation (8.9). Compute the first terms of the 
t-expansion of (2( 惑 . 

Resi 8 = 二 ( 哆 ) 十 生 4( 隔 )4( 葬 ) 硬 十 二 (4()24( 苞 ) 了 十 
录 ( 欧 )4( 鸣 2 区 十 4/( 苞 )(4( 苞 ) 殉 402) 瑟 ) 开 [4405)405) 巧 ]) 
Consider the 2-stage Gauss method of order p = 4. In the corresponding Lie 
group method, eliminate the presence of 2 in [12,4] by iteration, and neglect 
higher order commutators. Show that this leads to 
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21. 


22: 


23. 


24. 
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而 二 仙人 
工 
趟 


6 2 12 
人 


1 一 exp 人 (3 41 十 5 42) 必 LA 4a]jg， 


Where 4; = 4( 玫 ) and 到 = exp(( 人 2)y0. Prove that this is a Lie group method 
of order 4. Is it Symmetricy7 

In Zanna (1999) a Lie group method Similar to that of Exercise 19 is presented. 
The only difference is that the coefficients (一 1/12 十 V3/24) and (1/712 十 
V3/24) in the formulas for Pi and V22 are replaced with (一 5/72 十 V3/24) and 
(5/72 十 V3/24), respectively. Is there an error somewhere? Are both methods 
of order 47 本 

Show that for given 多 the solution of problem (9.7) 了 三 UL TV， where 
站 二 TUZV7T is the singular value decomposition of 了 . 

刷 iE Since |SVTIP = 8 holds for all orthogonal matrices UV and V， 
it is sufficient to consider the case 立 = (了 ,0)7 with 吕 = diag(ol,. ,an). 
Prove that |( 瑟 ,0)7 一 YY 人 > 半 2 (ci 一 12 for all matrices Y satisfying 
YX 三 元 

Show that the solution of the matrix differential equation Y 一 4 人 (by on 要 "xR， 
with initial values Yo E )2n,k，, Can be decomposed as 


7 人 提 = 三 芝 人 (Sb)， Where U ES E 顶 &xk 








12 





Satisfy the differential equations 
93S=U74US5， TU=(U-TUT4T 


with initial values 95o 三 7 Do = Y0. 

RemiaK These differential equations can be used for the computation of Lya- 
punov exponents as an alternative to the differential equations discussed in 
Bridges 儿 Reich (2001) and Dieci, Russell 久 van Vleck (1997). 

Consider the map GLKE) x mx 一 人 (thatassociates to (9, LT)the 
rank-K matrix y 一 USV7T. Show that the extended tangent map 


限 &x X 了 TrUrm 义 了 TV) 一 TY AR X 50( 丰 ) 义 50(K) 
(65,6U)57) mm (00ST 二 DT65STT +TS6VT TI76D 7 


jls an ljSomorphism. 

Let 4( 轨 E 下 "x” be symmetric and depend smoothly on 上 Show that the 
solution 已 ( 坟 <E 9 ofthe dynamical low-rank approximation problem on the 
Grassmann manifold， 


忆 ETp9 with |P=-4lm=minl 
is given as 已 二 YY7 where 了 <E )Jmk Solves the differential equation 


立 = (IT 一 YYZ)4T 


Chapter YV. 
Symmetric Integration and Reversibility 


Symmetric methods of this chapter and Symplectic methods of the next chapter Play 
a central role in the geometric integration of differential equations. We discuss Te- 
verSsible differential equations and reversible maps, and we explain how Symmetric 
integrators are Telated to them. We Study Symmetric Runge-Kutta and composition 
methods, and we Show how standard approaches for solving differential equations 
on manifolds can be Symmetrized. A theoretical explanation of the excellent long- 
time behaviour of symmetric methods applied to reversible differential equations 
will be given in Chap. XI. 


V.1 Reversible Differential 了 Equations and Maps 


Conservative mechanical Systems have the property that inverting the initial direc- 
tion of the velocity vector and keeping the initial position does not change the Solu- 
tion trajectory, it only inverts the direction of motion. Such Systems are “Teversible”. 
We extend this notion to more general Situations. 


Definition 1.1. Let p be an invertible linear transformation in the phase Space of 
= /四 .This differential equation and the vector field F(y) are called p-7reversiple 
这 

pjW) = 一 Ap forall y. (1.1) 








jy) 
如 7 忆 多 一 人 
0 0 5 和 
一 jy) 
Jpy) 
DOV 02V0 
Dj(y) 


Fig. 1.1. Reversible vector field (left picture) and reversible map (right picture) 
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This property is lustrated in the left picture of Fig. 1.1. For p-reversible differ- 
ential equations the exact fow wx(y) satisfies 


popot=p_iop=orlop (1.2) 


(See the picture to the Tight in Fig. 1.1). The right identity ls a consequence of the 
group property pr o ps = pi+s, and the left identity follows from 


李 (popi)(y) = oj(ee()) = -jos 20()) 
二 (p_iop)(y) = -1((e- o 0)(O)， 


because al] expressions of (1.2) satisfy the Same differential equation with the Same 
initial value (po po)j(y) = (poop)(y) = py. Formula (1.2) motivates the following 
definition . 


Definition 1.2. A map @(y) is called p-7reversiple 让 


po 瑟 =@ Lo0. 
卫 xample 1.3， An important example is the partitioned System 
勾 一 ju)， 局 一 9(uU)， (1.3) 
Where ju -v) = 三 一 帮 uV) and qg(u 一 v) = 9(uU). Here, the transformation p is 


given by pu V) 三 (2 一 四 . 和 we call a vector field or a map 7eversibple (without 
specifying the transformation P)，we mean that it ls D-reversible with this particu- 
lar 0. All second order differential equations 忆 = 9g(2) written as 忆 王 光 了 三 9 
are TeverSsible. As a first Implication of reversibility on the dynamics we mention 
the following fact: 寺 and v are Scalar and 让 (1.3) is reversible, then any Solution 
that crosses the -axis twice is periodic (Exercise 9,， See also the solution of the 
pendulum problem in Fig.I.1.4). 


It is natural to search for numerical methods that produce a reversible numerical 
fow when they are applied to a reversible differential equation. We then expect the 
numerical solution to have long-time behaviour Similar to that of the exact Solutioni 
See Chap. XI for more precise Statements. It turns out that the p-reversibility of a 
numerical one-step method is closely related to the concept of symmetry. 

Thus the method is theoretically symzzzetrical or reversible, aterminology 


we have never Seen applied elsewhere. 
(PC. Hammer & JW. Hollingsworth 1959) 


Definition 1.4. A_ numerical one-step method 1 is called sy7az1tetric OF 1211e- 
1eversiple,L ifit satisfies 


ho2 三 这 or equivalently 2) 一 -1 


一 彤 ” 
1 The study of symmetric methods has its origin in the development of extrapolation meth- 
ods (Gragg 1965, Stetter 1973), because the global error admits an asymptotic expansion 


in even powers of 几 . The notion of time-reversible methods is more common in the Com- 
putational Physics literature (Buneman 1907). 
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With the Definition I.3.1 of the adjoint method (ie., 7 王 G- 1)， the condition 
for symmetry reads G1 = 5. Amethod Wi = 2(Vyo) is symmetric 让 exchanging 
2o 作 Vand 玉 人 一 六 leaves the method unaltered. In Chap.I we have already en- 
countered the implicit midpoint rule (1L.1.7) and the Stormer-Verlet Scheme (1.1.17)， 
both of which are Symmetric. Many more Symmetric methods will be given in the 
following sections. 


Theorem 1.S. 矿 a 1U11erical 1711etjpod apPplied 1o qd D-7eversiple djrerential equa- 
101， SG1isjiey 
Do 思 王 2_no0D， (1.4) 


1je1 妨 e 11U1lefical Jow Oh 713 a D-7eve1Siple 11aD 矿 a11d o111y 矿 O1 13 Q Sy1I111etrrC 
111e11od. 


Prooj Ags a consequence of (1.4) the numerical fow 27 is p-Teversible 过 and only 
让 29_n op 王 Gj o 0. Since pls an invertible transformation, this is edquivalent to 
the Symmetry of the method 呈 7). 














Similarly, it ls also true that a Symmetric method is p-reversible 寺 and only 二 
the 0-compatibllity condition (1.4) holds. 

Compared to the Symmetry of the method, condition (1.4) is much less restric- 
tive. It is automatically Satisfied by most numerical methods. Let us briefy discuss 
the validity of (1.4) for different classes of methods. 


e@ 有 RU18e-KU1ta 1ietjpods (explicit or implicit) Satisfy (1.4) without any restriction 
other than (1.1) on the vector field (Stoffer 1988). Let us illustrate the proof with 
the explicit Euler method 环 ,(yo) = yo 十 刀 Foo): 


(oo 于 )(yo) = pyo 十 Ppjoo) = pyo 一 六 joyo) = -nyo). 


Partitio1ed Ru118ge-KUtta 11etjpods applied to apartitioned System (1.3) Satisfy the 
condition (1.4) 证 pf(w; v) = (pi(u),pa(w)) with invertible pl and pa. The proof is 
the same as for Runge-Kutta methods. Notice that the mapping p(u,u) 一 (了 一) 
of Example 1.3 is of this Special form. 

e@ Co11DoO5i1io7 11etpods. Itwo methods SG and 内， Satisfy (1.4), then So does the 
adjoint 2 and the composition Zn o 败 . Consequently, the composition methods 
(3.1) and (3.2) below，which compose a basic method 2j and its adjoint with 
different step Sizes, have the property (1.4) provided the basic method 2n has it. 
SPlitting netpods arebased on asplitting 0 = (人 力 二 FDI(y) ofthe differential 
equation. If both vector fields,， FDI(y) and jj(), satisfy (1.D, then their exact 
fows oh and o 问 Satisfy (1.2). In this Situation, the Splitting method (1H.3.6) has 
the property (1.4). 

e For di1jerenfial eqguatio1y om 711a11 加 lds we have to assume that p maps A4 to 
人 4. Otherwise, condition (1.1) does not make sense. For the projection method 
of Algorithm IV.4.2 with orthogonal projection onto the manifold we have: 直 
the basic method satisfies (1.4) and 过 p is an orthogonal matrix, then it Satisfies 
(1.4) as well. This follows from the fact that the tangent and normal Spaces Satisfy 
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ToyA = pyNM and NoyNM = pyAM,respectively. A similar result holds 
for methods based on local coordinates, if the local parametrization is well cho- 
Sen. For example, this is the case 让 pwV(z) is the parametrization at pyo whenever 
U(z) is the parametrization at V0. 


V.2 Symmetric Runge-Kutta Methods 


We give acharacterization of symmetric methods of Runge-Kutta type and mention 
Some important examples. 


V.2.1 Collocation and Runge-Kutta Methods 


Symmetric collocation methods are characterized by the Symmetry of the colloca- 
tion points with respect to the midpoint of the integration Step. 


Theorem 2.1. 77Pe aajoipt 1ietpod ora coliocafion 1ietpod (De1ipzifion 太 7.3) pased 
Oo11 Cl1,.. .cs13sacoliocation 111etpod pased o1l cT ,cs Wje1e 


沫 ee 


Ci; 1 一 Cs 十 1 一 1 (2.1) 


1 1jpe case 1paf ci 三 1 一 csrl ijoralz tipe colocatioz 11ethod 18 8y11111e11ic. 
7jpe ad1joipat 1ietpod of a discomti1zuols coliocatiom 111etpod (De1ijzitiomn 太 7.0) 


Dased om D1;,bs a11d co .cs 18adiscotintos collocatiom 17ethpod pased om 
上 ,0 14 co CT， We1e 
厅 一 bs， 了 多 =D 01d cc 三 1 一 cs (2.2) 


瑟 1e case 1atbl 三 bsad1zd ci 三 1 一 cs ijoral2 加 edisco1atiiuoxs co1locatiom 
111e11od 18 Sy111111e117C. 





0 C1 C2 C3 C4 C5 1 


Fig. 2.1. Symmetry of collocation methods 


Pioof Exchanging (to,yo) 他 (,VW) and 玉 人 一 inthe definition of a collo- 
cation method we get v( 刀 ) = (三 一 cip) 一 (三 一 ci 人 (1 一 ci 由))， and 
20 一 尺 ( 丰 一 内 . Inserting 扩 一 如 十 几 this yields the collocation method based 
on ci of (2.1). Observe that the c; can be arbitrarily permuted. For discontinuous 
collocation methods the proof is similar. 














The preceding theorem immediately yields the following result. 
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Corollary 2.2. 71Pe Carss jommtlasy (72ple 到 71. 册 as wellas 1Pe ZLopatto 70A4 (72- 
Pie 11L.71.2)a10d Lopatto 111B Joratlasy (Tapie 凡 L7. 有 aaje sy11111et1ric 101eg1G1o18. 














Theorem 2.3 (Stetter 1973，VWVanner 1973). 77jpe adajoint 1ietpod oF da1z s-8iage 
民 z1128e-KU1ia 111e1pod (77 有利 13 0a8a10 G1 5-8S1G8e RU118e-KU1G 111etjpod 11s coe 矿 - 
Cie11y G1e give Dy 


Qj bs+1-7 一 Qs 二 1 一 让 s 十 1 一 7 太 二 和 机 三 (2.3) 


矿 





Qs+1_is+l7 十 ai 三 0 Jora1ll2 7 (2.4) 


态 e1 态 e Rage 一 Rita 71aetpod (下 7. 乱 入 Sy1110etric.? 


Proof Exchanging yo 人 VL and 一 一 六 in the Runge-Kutta formulas yields 
应 一 让 (二 仿 一 oj 有 ) 四 = 加 + DC.5) 
5 一 1 i 一 1 


Since the values 271( 区 一 aij) 三 1 一 ciappear in reverse order, we Teplace ji by 
Ks+1_iin (2.3), and then we Substitute all indicesz and7 by s 十 1 一 zand s 十 1 一 7， 
TeSpectively. This proves (2.3)， 


The assumption (2.4) implies Qi 一 aij and 一 bi, So that G1 一 G1. 














Explicit Runge-Kutta methods cannot fulfill condition (2.4) with 7 = 7, and it is 
not diffcult to see that no explicit Runge-Kutta can be Symmetric ( Exercise 2). Let 
us therefore turn our attention to diago1maliy ;102PLiciL RU128e-KUita 1Ietjpods (DIRK)， 
for which ai = 0 for 7 < 7 but with diagonal elements that can be non-zero. 
thls case condition (2.4) becomes 


Qi 一 0 一 bs+1-7 for ? > 7， Qj7 十 Qs+1-7s+1-7 一 0 (2.0) 


The Runge-Kutta tableau of such a method is thus of the form (e.g.,fors 王 5) 


C1 Q11 
C2 Da22 
C3 pb 0 aa33 


1 一 cp pb pb ad44 人 


1 一 cl bp pb 加 oa55 
0 








With as3 = pa3/2, a44 = 0 一 ao2,anda55 一 0 一 all.Ione ofthe wu vanishes， 
then the corresponding stage does not infuence the numerical result. This stage can 
therefore be Suppressed, so that the method is equivalent to one with fewer Stages. 
Our next result Shows that methods (2.7) can be interpreted as the composition of 
0-methods, which are deftined as 


2 For irreducible Runge-Kutta methods, the condition (2.4) is also necessary for symmetry 
(after a suitable permutation of the stages). 
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2(yo)=，， where 页 =m+PFL-0y+b0) (02.8) 
Observe that the adjoint of the 0-method is 0 一 一 


Theorem 2.4. 4 diagoma11y ziPLicit Ra128e-KU1ta 1Ietjpod SatlisAi7g 1pe Sy1z111e11y 
co1ditioz (2.4) a10 天 058eguivalent io acomposifionz or0-111etpods 


CE1 洲 CQL2 洲 2 CT 
2 0 2592 (2.9) 
Where ai 一 Qiz/0i 


Proof Since the 0-method is a Runge-Kutta method with tableau 


0 10 
1 


this follows from the discussion in Sect.IIL.1.3. We have used 5 全 写 人 
which holds, because bs+1 三 wandasfl ii 三 1 一 aiby(2.0). 














A more detailed discussion of Such methods is therefore postponed to Sect. V.3 
on Symmetric composition methods. 


V.2.2 Partitioned Runge- 下 utta Methods 


Applying partitioned Runge-Kutta methods (1.2.2) to general partitioned Systems 


= 7 2， 之 一 9(2 2)， (2.10) 


it is obvious that for their symmetry both Runge-Kutta methods have to be Symmet- 
iic (becausey = jy) andz = 9g(z) are Special cases of (2.10)). The proof of the 
following result is identical to that of Theorem 2.3 and therefore omitted. 


Theorem 2.S. 矿 1Pe coeicien1s of Dot Ru1z8ge-KU1G 1aetjhods Di) aij G11G 0 
Sa1isjy 1je co11di1io11 (2. 和 因 1 太 e1a 1je Patitio1led Ru118e-KU1ta 11etpod (11.2.2) 718 Sy111- 
111E177C. 














As a consequence of this theorem we obtain that the Lobatto IIA-IUB pair (See 
Sect. JI.2.2) and, in particular, the Stormer-Verlet Scheme are Symmetric integrators. 

An interesting feature of partitioned Runge-Kutta methods is the possibility of 
having expliczh sy1z111etric methods for problems of the form 


= /2)， 之 一 9(y). (2.11) 


Second order differential equations Y = 9(V), written in the formy = 2 之 三 9g( 
have this structure, and also all Hamiltonian Systems with separable Hamiltonian 
万 (pp,q) =T(D) 十 了 (9q).Itis not possible to get explicit symmetric integrators with 
non-partitioned Runge-Kutta methods (Exercise 2). 

The Stormer-Verlet method (Iable HU.2.1) applied to (2.11) reads 
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刀 
21/2 二 2Z0 十 训 9(Vo) 





V1 二 yo 十 屎 所 2172) 
几 
2 一 21/2 十 事 纺 2 


and is the composition 7 /29 21n/2, Where 





一 V0 十 风 
册 | 二 项 轴 | (2.12) 
2 20 2 一 20 十 Pg(Vo) 
is the Symplectic Euler method and 
一 V0 十 见 
3 一 呈 人 2/ 20 jz0) C.13) 
好 20 和 一 20 十 Ag(O) 





its adjoint. All these methods are obviously explicit. How can they be extended to 
higher order? The idea is to conslider partitioned Runge-Kutta methods based on 
diagonally implicit methods such as in (2.7). If ai 0ii 一 0, then one component 
of the ?th stage ls given explicitly and, due to the Special Structure of (2.11), the 
other component is also obtained in a straightforward manner. In order to achieve 
ai 0ii 一 0 with asymmetric partitioned method,， we have to assume that s, the 
number of stages, js even. 


Theorem 2.6. 4 Partifioned RU128e 一 KG 711e 纺 0@ Dased om foO dia8O7G17 PHic 开 
1aetjpods Satispjipg ai 0 一 0 and(2.4 Wi 太太 天 0a1nd 访 天 0, 718 equialerat 
10O 4 co11poSifio1 Of Go ad 号 六 Yi 2 ad 2 8iye Dy (2.12) 01d (2.73 
1eSDectivey 














For example, the partitioned method 





0 信 

六 D 0 
 2 0 加 
Dj 四 站 2 2 0 
四 有 和 D 轴 


Satisfies the asSumptions of the preceding theorem. Since the Inethods have identical 
Stages， the numerical result only depends On 全 ， b1 十 b2， 也 十 1 pb3 十 04，and 
1 Therefore，we can assume that 已 一 一 ww and the method is equivalent to the 
composition 2 o bn o 5 o bi 


V.3 Symmetric Composition Methods 


In Sect. JI.4 the idea of composition methods is introduced, and a Systematic Way 
of obtaining high-order methods is outlined. These methods, based on (1.4.4) or on 
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(1U.4.3),turn out to be symmetric, but they require too many Stages. A theory of order 
conditions for general composition methods ls developed in Sect. HI.3. Here，we 
apply this theory to the construction of high-order Symmetric methods. We mainly 
follow two lines. 


@ ,9y111111E117iC CO11DOSI1IO1 OF Jst order 711etpods. 
玩 三 BuspoGpgho.o55jo Gaiio 9 (3.1) 


where Oh is an arbitrary first order method. In order to make this method Sym- 
metric, we assume as = 0 as。 1 = Do etc. 
@ ,9y111111E117iC CO11DOSI1IO1 OF sy11111et1ric 11et1jodsy. 


三 Cnog no.oDnoCnhh (3.2) 


where On is asymmetric Second order method and ys。 三 说 ,Ts -1 三 2 etc. 


V.3.1 Symmetric Composition of First Order Methods 


Because of Lemma 3.2 below, every method (3.2) is a Special case of method (3.1). 
In this subsection we concentrate on methods that are of the form (3.1) but not of 
the form (3.2). 

For constructing methods (3.1) of a certain order, one has to Solve the System 
of nonlinear equations given in Theorem IIU.3.14 (see also Example II.3.15). The 
Symmetry asSumption on the coefficients considerably simplifies this System. 


Theorem 3.1.， 矿 1je coe 太 cients of aetjpod (3.7) Satisjp as 三 永 jora117 tpe7 
这 1 Scient to comsider 加 ose treesy wiip odd | 了 ||. 


Prooj This is a consedquence of Theorem JI.3.2 (the maximal order of symmetric 
methods is even). In fact 过 the condition for order 1 is satisfied, it is automatically 
of order 2. I 革 in addition, the conditions for order 3 are Satisfied, it is automatically 
of order 4, etc. 














It may come as a Surprise that the popular leapfrog ... can be beaten, but 
only slightly. (R.IL McLachlan 1995) 


Methods of Order 2. The only remaining condition for order two is > _1(ak 十 
有 大) = 1 and,fors= 王 1,thesymmetry requirement leads to Ghy2oG7 /2 Depending 
on the choice of wj, this method is edquivalent to the midpoint rule, the trapezoidal 
rule, or the Stormer-VYerlet Scheme, all very famous and frequently used. However， 
McLachlan (1995) discovered that the case 5 一 2 can be Slightly more advanta- 
geous. We obtain 

un O Sa7j2 on O 2G72 一 o)P O < (3.3) 
Where a ls a free parameter, which can Serve for clever tuning. 


Minimizing the Local Error of Composition Methods. Subtracting the 已 ~- 
Series of the numerical and the exact Solutions (See Sect. HI.3.2), we obtain 
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Fig. 3.1. The error functions |ai(a)| deftined in (3.5) (left picture). Work-precision diagrams 
for the Kepler problem (as in Fig.I.4.4) and for method (3.3) with w 三 0.25 (Stormer 一 
Verleb, a = 0.1932 (McLachlan), and a = 0.22.“IE” method On treats position by implicit 
Euler velocity by explicit Euler;“EI”: method Sn treats position by explicit Euler velocity 
by implicit Euler 


1 
(中 ) 人 





用 乓 一 和 (的 = om-eO) FODW+OO2) 


lrlEpH1 


Assuming that the basic method has an expansion @Gjp(y) 一 十 刀 Fo) 十 Pda(y) 十 
hsds(y) 十 . ,weobtain for method (3.3), similar to (IIL3.3), the local error 


肛 (@(aJ@( 殷 二 ee(a)( 的 用 (的 +@(a) 人 (Pa 人) 


aa(PAD)OTe(UPhGTo0O 
Which contains one term for each of the S trees7 E Ts with ||r|| = 3. The qi(o) 
are the polynomials 
wa) = 了 (1 一 6 十 12o2)， 5 甩 而 兰 了 (-1+6a 一 8o2)， 二 
oo=-o ， oo=i-6a+6o)， oO=jao， 


which are plotted in the left picture of Fig.3.1. 玉 we allow arbitrary basic methods 
and arbitrary problems, all elementary differentials in the local error are indepen- 
dent, and there is no overall optimal value for aQw. We see that the modulus of qi(o) 
and qz(a) are minimal for a = 1/4, which is precisely the value corresponding to a 
double application of 多 py2 9 于 /2 With halved step size. But the values |qa(oo)| and 
da4(a)| become smaller with decreasing a (close to a = 1/4). McLachlan (1995) 
therefore minimizes Some norm of the error (See Exercise 4) and arrives at the value 
Q 三 0.1932. 

In the numerical experiment of Fig.3.1 we apply method (3.3) with three differ- 
ent values of a' to the Kepler problem (with data as in Fig.I.4.4 and the symplectic 
Euler method for 2n). Once we treat the position variable by the implicit Euler 
method and the velocity variable by the explicit Euler method (central Picture), and 


152 V. Symmetric Integration and Reversibility 


once the other way round (Tight picture). We notice that the method whphich is best in 
one case ls worst in the other 

This Simple experiment Shows that choosing the free parameters of the method 
by minimizing Some arbitrary measure of the error coefficients is problematic. For 
higher order methods there are many more expressions in the dominating term of 
the local error (for example: 29 terms for ||r|| = 5). The corresponding functions 9i 
give alot of information on the local error and they indicate the region of parame- 
ters that produce good methods. But, unless more information ls known about the 
problem (Second order differential equation, nearly integrable Systems), one usually 
minimizes, for orders of 8 or 10, just the maximal values of the ai, 必 , or 飞 (Kahan 
久 Li 1997). 


Methods of Order 4. Theorem 3.1 and Example IIU.3.15 give 3 conditions for 
order 4. Therefore, we puts 三 3in(3.1) and assume Symmetry DO1 三 Qa3, Da 一 Q2， 
and 03 = al1. This leads to the conditions 


工 
ai+ooTos= 王 = (二 -agoa+oo)=0 


Since with al 十 az =0or with al 十 as =0thefirsttwo ofthese equations are not 
compatible, the unique solution of this System is 


1 21/3 


273 72 二 279 


We observe that 让 = ai for all ?. Therefore, Bo 26， can be grouped together in 
(3.1) and we have obtained a method of type (3.2), which is actually method (IL.4.4) 
with p = 2. 

Again, the Solutions with the minimal number of stages do not give the best 
methods (remember the good performance of Suzuki's fourth order method (1.4.3) 
in Fig.JI.4.4), So we look for 4th order methods with larger s. McLachlan (1995) 
has constructed a method for s = 5 with particularly small error terms and nice 
coeffcients 














4-v 硬 146+5Vv 西 
DO = a5 1 ad 三/ 访 5 (3.0) 
六 ww 二 -2 一 20v 古 1 
过 4 270 ) 4 135 》 对 和 : 


Which he recommends "for all uses”“. 

In Fig.3.2 we compare the numerical performances of all these methods on oOUT 
already well-known example in both variants (implicit-explicit and Vice-versa). We 
See that the best methods in ojze Picture may be worse in the other. For comparison， 
the results are SUrrounded by “ghosts in grey” Tepresenting good formulae from the 
next ]ower (order 2) and the next higher (order 6) class of methods. 


Methods Tuned for Special Problems. In the case where one is applying a specia/ 
method to a special! problem (e.g., to Second order differential equations or to Small 
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Order 4 Order 4 























Fig. 3.2. Work-precision diagrams for methods of order 4 as in Fig.3.1;“3j: the Triple Jump 
(1L.4.4); “su” method (IIL.4.5) of Suzuki; “ml McLachlan (3.60);“bm” method (3.7); in 
grey: neighbouring order methods StormerVerlet (order 2) and p6 s9 (order 6) 


perturbations of integrable Systems), more Spectacular gains of efficiency are pos- 
Sible. For example, Blanes 儿 Moan (2002) have constructed the following fourth 
order method with s 一 6 


01 = a6 =0.082984406417405， al = 06 = 0.16231455076687， 
[02 = a5 = 0.23399525073150， a2 = [5 三 0.37087741497958， (3.7) 
[03 = a4 = 一 0.40993371990193， a3 一 04 三 0.059762097006575 ， 


which,， when correctly applied to second order differential equations (Tight picture 
of Fig.3.2) exhibits excellent performance. 

Further methods, adapted to the integration of Second order differential equa- 
tions, have been constructed by Forest (1992), McLachlan 儿 Atela (1992), Calvo 
人 久 Sanz-Serna (1993), Okunbor 多 Skeel (1994), and McLachlan (1995). Another 
important Situation，which allows a tuning of the parameters,， are near-integrable 
Systems Such as the perturbed two-body motion (e.g., the outer Solar System consid- 
ered in Chap.D. If the differential equation can be split into 六 = FDI(w) 二 FDI(y)， 
where 1 = /HH(W) is exactly integrable and JfD](o) is small compared to FDI(y)， 
Special integrators Should be used. We refer to 人 inoshita, Yoshida 儿 Nakai (1991)， 
Wisdom & Holman (1991), Saha & Tremaine (1992), and McLachlan (1993b) for 
more details and for the parameters of such integrators. 


Methods of Order 6. By Theorem 3.1 and Example II.3.12 a method (3.1) has to 
Satisfy 9 conditions for order 6. It turns out that these order conditions have already 
asolution with s 王 7, but all known solutions with s 和 8 are equivalent to methods 
of type (3.2). With order 6 we are apparently close to the point where the enormous 
Simplifications ofthe order conditions due to Theorem 3.3 below start to outperform 
the freedom of choosing different values for oa; and .We therefore continue oUT 
discussion by considering only the Special case (3.2). 
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V.3.2 Symmetric Composition of Symmetric Methods 


The introduction ofmore symmetries into the method Simplifties considerably the or- 
der conditions. These simplifications can be bestunderstood with a Sort of "Choleski 
decomposition ”of symmetric methods (Murua 久 Sanz-9erna 1999). 


Lemma 3.2. Fopr eyery sy11a11letric 111etPod Gp (Oo) 蕊 at aa1aits a1a expa118io7 六 Pow- 
e18 ok 几 妨 eye exists Gp(V) SUCP 芒 at 


环 ( 人 = (Gojao 人 (人 


Pioof Since 21(V) = 十 OP is close to the identity, the existence of a unique 
method B (y) = 十 Pai( 轨 十 Pdo( 人 十 .satisfying 7 一 G1a o G1 2 follows 
from Taylor expansion and from a comparison of like powers of 几 . 

ICn(y is symmetric, we have in addition 


二 于 | 
2 有 二, 业 寺 2_7112 岂 2 呈 _7172， 











and /2 三 G-1 /2 二 Gy 12 follows from the uniqueness of 瑟 . 





We let 2 be asSymmetric method, and we consider the composition 
且 三 CooDopnoCn: (3.8) 


Using the method 外 of Lemma 3.2, this composition method is equivalent to (3.1) 
(2 replaced with 2 ) with 


o = 应 三 2 G.9) 


Theorem 3.3， For co1DPositio1m 1Ietpods (3.8) Wi 如 Sy1a11etric 下 1 SUHciet 1o 
CoO11Sider 1je ojder co11difio11y of 7T1eoe1 11374Jor7EHwjpere ayverticesy or7 
Pave odd idices. 


Pioof Ifi(r) is even, it follows fom oz = DB and from (II.3.11) that 





as(T) = as_1(7T) = .=al(7)=ao(7T) = 0. 





Since e(7) = 0 for such trees, the corresponding order condition is automatically 
Satisfied. Any other vertex with an even index can be brought to the root by applying 
the Switching Lemma II.3.8. 














After this reduction,，only 7 conditions SUrvive for order 6 from the trees dis- 
played in Example HI.3.12. A further reduction in the number of order conditions is 
achieved by asSsuming sy1z11let1ric coejpcie1ts in method (3.8), 1.e.， 


?Ts+1-7 二 2 for all (3.10) 


This implies that the overall method 内 ls Symmetric, So that the order conditions 
for trees with an even ||r|| need not be considered. This proves the following result. 
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2D=2 |2=4 2=6 


aleosey 


D 王 1 


pw 电光 


Fig. 3.3. Symmetric Composition of Symmetric Methods up to order 10 























Theorem 3.4， For co11zPpositio1 11etpods (3.8) wii Sy11111etric Gj Sa1isAi1g (3 了 .10 
矿 13 SUjpcie11t 1o co1lsider 1je o1der co1ditio1ly Jor7 E 人 Where al verticey or7 
jave odd ipndices and wpere | 7 zs odQ. 














Figure 3.3 Shows the remaining order conditions for methods up to order 10. We 
See that for order 6 there remain only 4 conditions, much less than the 166 that we 
Started with (Theorem III.3.0). 


了 xample 3.S.， The rule of II.3.14) leads to the following conditions for sy7z711zetric 
composition of sy7az7111etric methods: 





Order 2: 人 >》， 庆 三 工 
K=1 





Order4:  G@) >》 说 二 0 
K=1 








/ 2 
Order0:  @ >》， 斋 =0 这 (到 7 一 0 
有志 RS 
S S 达 2 
Order 8:  @O) >》， ?7 三 0 办 全 (二 7 三 0 
太 二 站 太 三 下 庆 
DOG) S 上 ， & / (DCOD S & 4 
六 误 ( >? = 0 
K=1 =1 =1 忆 KE 一 1 = 








Here, Similar to Example JI.3.13, a_Przmae attached to a Summation Symbol indi- 
cates that the last term yz is taken as ?77/2. 


Methods of Order 4. The methods (1L.4.4) and (II.4.5) are both of the form (3.8)， 
and those with p = 2 yield methods of order 4. We have seen in the experiment 
of Fig.I.4.4 that the method (1L.4.3) yields more precise approximations; See also 
Fig.3.2. We do not know of any 4th order method of type (3.2) that ls Significantly 
better than method (3.1) with coefficients (3.6). 
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Methods of Order 6. HI we search for a minimal stage Solution of the four 
equations for order 6, we apparently need four free parameters 71,72,73,74; then 
?5;,36,77 are determined by symmetry. Equation @ gives 4 三 工 一 2(71 十 2 十 33). 
So we end up with three equations for the three unknowns 71,72,73. A numerical 
Search for this problem produces three Solutions, the best of which has been discov- 
ered by many authors, in particular by Yoshida (1990), and is as follows: 


妨 三 办 二 0.78451361047755726381949763 ”26 57 
22 三 6 三 0.23557321335935813368479318 | 
?3 三 人 三 一 1.17767998417887100694641568 

4 三 1.31518632068391121888424973 1 


Using computer algebra，Koselefft (1996) proves that the nonlinear System for 
?71;7?72,73 has not more than three real Solutions. 

Similar to the Situation for order 4, where relaxing the minimal number of Stages 
allowed a significant increase of performance, we also might expect to obtain better 
methods of order 6 in this way. McLachlan (1993) increases s by two and constructs 
good methods with small error coefficients. By minimizing maxi |%|, Kahan 多 Li 
(1997) obtain the following excellent method 3 


(3.11) 





?1 三 To 0.39216144400731413927925056 D6 s9 

22 三 外 三 0.33259913678935943859974864 
?73 三 和 三 一 0.70624617255763935980996482 
4 三 6 三 0.08221359629355080023149045 | 
5 三 0.79854399093482996339895035 1 


This method produces，with a comparable number of total steps, errors which are 
typically smaller than those of method (3.11). Numerical results of these two meth- 
ods are given in Fig. 3.4. 


(3.12) 








Order 6 Order 6 




















Fig. 3.4. Work-precision diagrams for methods of order 6 for the Kepler problem as in 
Fig.3.1;“7”: method p6 s7 of (3.11); “9”: method p6 s9 of (3.12); in grey: neighbouring 
order methods (3.6) (order 4) and p8 s17 (order 8) 


3 The authors are grateful to S. Blanes for this reference. 
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Methods of Order 8. For order 8, Fig. 3.3 represents 8 equations to Solve. This in- 
dicates that the minimal value of sls 15. Anumerical search for solutions 7 ，. .78 
of these equations produces hundreds of solutions. We choose among all these the 
solution with the smallest max(|y). The coefficients, which were originally given 
by Suzuki & Umeno (1993), Suzuki (1994), and later by McLachlan (1993), are as 
follows: 


1 三 5 三 0.74167036435061295344822780 
?72 三 ?4 三 一 0.40910082580003159399730010 D8515 
?73 三 ?Ti3 三 0.19075471029623837995387626 
?4 三 Ti2 三 一 0.57386247111608226665638773 
= 三 ?1 三 0.29906418130365592384446354 
?6 三 ?io 三 0.33462491824529818378495798 
07 三 ?9 三 0.31529309239676659663205666 
?8 = 三 一 0.79688793935291635401978884 


By putting s = 17 we obtain one degree of freedom in Solving the equations, This 
allows an Improvement on the foregoing method. The best known solution, slightly 
better than a method of McLachlan (1995), has been found py Kahan 儿 LI (1997) 


| .13) 


0.13020248308889008087881763 





二 和 二 
72 三 ?6 三 0.56116298177510838456196441 
轨 三 mi5 = 一 0.38947496264484728640807860 ”2D8 5s17 
?74 三 ?4 三 0.15884190655515560089621075 | 
35 三 ?13 三 一 0.39590389413323757733623154 | (3.14) 
?6 三 i2 三 0.18453964097831570709183254 1 
27= 一 TI 三 0.25837438768632204729397911 0 1 
?8 = 三 ?io 三 0.29501172360931029887096624 
?9 = 三 一 0.60550853383003451169892108 


Numerical results, in the Same style as above, are given in Fig.3.5. 





Order 8 Order 8 




















Fig. 3.$. Work-precision diagrams for methods of order 8 for the Kepler problem as in 
Fig.3.1;“15” method p8 s15 of (3.13);“17”: method p8 s17 of (3.14); in grey: neighbouring 
order methods 26 s9 (order 6) and p10 s35 (order 10) 


158 V. Symmetric Integration and Reversibility 


Methods of Order 10. The first methods of order 10 were given by Kahan & Li 
(1997) with s = 31 and s = 33, which could be improved on after Some nights of 
computer Search (See method (V.3.15) of the first edition). A Significantly Improved 
method for s = 35 (See Fig.3.5 for a comparison with eighth order methods) has in 
the meantime been found by Sofroniou 儿 Spaletta (2004): 





7 三 ?35 三 0.07879572252168641926390768 
?2 三 ?34 三 0.31309610341510852776481247 
?73 三 733 三 0.02791838323507806610952027 
?4 = ?3a2 = 一 0.22959284159390709415121340 
三 ?al 三 0.13096206107716486317465686 
?6 三 ?30 = 一 0.26973340565451071434460973 ”2p10 535 
?7 三 ?9 三 0.07497334315589143566613711 | 
?8 三 ?sg 三 0.11199342399981020488957508 | 
?9 三 ?7 三 0.36613344954622675119314812 | 0G3.15) 
?10 三 ?726 三 一 0.39910563013603589787862981 | ， 
9311 三 725 三 0.10308739852747107731580277 | 
?712 三 724 三 0.41143087395589023782070412 
?13 三 ?23 三 一 0.00486636058313526176219566 
?414 三 ?22 = 一 0.39203335370863990644808194 
015 二 ?21 三 0.05194250296244964703718290 
?16 三 ?720 三 ”0.05066509075992449633587434 
?217 三 Tio9 三 0.04967437063972987905456880 





?18 三 ”0.04931773575959453791768001 


V.3.3 Effective Order and Processing Methods 


There has recently been a revival of interest in the concept of “effective 
Order . (J.C. Butcher 1998) 


The concept of effective order was introduced by Butcher (1969) with the aim of 
constructing 5th order explicit Runge-Kutta methods with 5 stages. The idea ls to 
Search for a computationally effticient method 及 Such that with a suitable Xp， 


7 一 XPnoRKnoxrl (3.10) 


has an order higher than that of 人 .The method Knis called the Ke7mel, and Xn can 
be interpreted as a transformation in the phase Space, close to the identity. Because 
of 

殉 一 Xpo 天 Re， 


an implementation of 必 , over N steps with constant Step Size 几 has the Same com- 
putational efficiency as 及] The computation of 2 has only to be done once at the 
beginning of the integration, and Xn has to be evaluated only at output points, which 
can be performed on another processor. In the article LOpez-Marcos, Sanz-Serna 久 
Skeel (1996) the notion of preprocessing for the step and pos1D1ocess1118 for 
Xp ls introduced. 
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了 上 xample 3.6 (Stormer-Verlet as Processed Symplectic Euler Method). Con- 
Slider a Split differential edquation, let GZ 三 2 O o 电 be the Lie-Trotter formula 


or Symplectic Euler method (See Sect. II.5), and 6 三 的 O o 问 O 史 the Strang 
Splitting or Stormer-Verlet Scheme. As a consequence of the group property of the 
exact flow, we have 


G[ 兰 5 O GZ7] O 网 二 一 Xj o GZ O X7 


with X7 三 Hence, applying the Lie-Trotter formula with processing yields a 


Second order approximation. 


Since the use of geometric integrators requires constant Step Sizes, it ls quite 
natural that Butcher s idea of effective order has been revived in this context. A SYS- 
tematic Search for processed composition methods started with the works of Wis- 
dom, Holman & Touma (1996), McLachlan (1996), and Blanes, Casas 色 Ros (1999， 
2000b). 

Let us explain the technique of processing in the Situation where the kernel 开 
is a Symmetric composition 


一 Co ooDnoDn (37s+1 一 mi forall7) (3.17) 
of a Symmetric method 2jm. We Suppose that the processor ls of the form 
2 二 22.9o925po257， (3.18) 
Such that its inverse ls given by (use the Symmetry 瑟 ) 一 2 71) 
XTL 一 GopnoG ipnooGE 5 (3.19) 


Order Conditions. The composite method 败 = Xn o 开关 o 二 is of the form 
胁 = oogenocpen with 


(sa2r+s，.. .22,51) 一 (0 .0 本 .00 办 (3.20) 


Theorem 3.3 thus tells us that only the order conditions corresponding to 7 E 刀 ， 
Whose vertices have odd indices, have to be consldered. Unfortunately, the sequence 
{si} of (3.20) does not satisfy the symmetry relation (3.10), unless all 6; vanish. 
However 让 we require 


X_Ah(y) = Xp 人) 十 O(RP+ )， (3.21) 


we see that XI(O) = 人 () 十 O(PP+l, and the method 萝 = XoRKnoxXTl 

is symmetric up to terms of order O(hz+1). Consequently, the reduction of Theo- 

rem 3.4 is vallid, so that for order p only the trees of Fig. 3.3 have to be considered. 
For the first tree of Example 3.3 the order condition is 


27 十 S S 


1 三 >》， <K 一 >》 7 
开 一 | 天 一 | 
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and we See that this is a condition on the kernel 有 only. Similarly, for odd 7? we 
have 


27 十 S 
0= 2 所 一 2 过 (3.22) 
So that also the trees G, @, @, .. .give conditions on 人 nn and cannot be influenced 


by the procesSof. We next consider the trees of Example 3.3 with three vertices， 
whose order condition 1s 


We split the sums according to the partitioning into 0 mi 一 0i 让 (3.20), and we 
denote the expressions appearing in Example 3.5 by aw(7) and those corresponding 
to Xn and Xi by b(7) and 呈 -1(7), respectively. Using the abbreviations mi for the 
tree with one vertex labelled ?, Tiy for the tree with two vertices labelled ; (the rootb) 
andJ,andby Ti7o the trees with three vertices labelled ; (roob,7 and d (vertices that 
are directly connected to the root), this yields 


0 = (Tijg) 二 am) (三 )2 (二 ar)t (Ta) 

二 al(Tig)oL(T) 二 alTijo) 十 TD)OI(T)o (To) (3.23) 
十 bm)OI(m)a(m) 二 MP)a(m)boL (ro) 十 b()a(m)a(7o) 

二 8) (mo) 十 Mri)a(ra) 十 Wio)o (TD) 十 WTia)a() 十 DTizg) 


How can we Simplify this long expression7 First of all， we imagine 人 nto be the 
identity (either s 一 0orall 六 三 0),sothat 内 三 Xno X becomes the identity. In 
this situation, the terms involving w(7) are not present in (3.23), and we obtain 





0 一 和 (ao 十 WO (TD)D ra) 十 Mr) (ro 十 以 Ta)o (77) 十 BT 


We can thus remove all terms in (3.23) that do not contain a factor a(7). Now ob- 
serve that by (3.21), xn(y) as well as X7 (yy) have an expansion in even powers of 
几 . Therefore, pg(7) and -I(r) vanish for all 7 with odd |r|. Formula (3.23) thus 
Simplifies considerably and yields 





0 = a(7s11) 十 22(731)a(7T1)， (3.24) 
0 = a(7511) 十 20(751)a(71)， (3.25) 
0 = a(7313) 十 (73l)a(7s) 十 733)a(7T). (3.20) 


AsSimilar computation for the last tree in Example 3.3 gives (in an obvious notation) 
0 一 a(731l111) 十 40(7Ts1)a(T1)3 十 40(73111)Q(7T1)， (3.27) 


Since a(m) = > ;1 Wi 三 1,the conditions (3.24), (3.25) and (3.27) can be inter- 
Preted as conditions on the processor, namely on (731)，b(751) and (73111). We 


V.4 Symmetric Methods on Manifolds 101 


already have auw(73) = 0 from (3.22), and an application of the Switching Lemma 
IIL3.8 gives (733) = 去 (2(73)2 一 (76)). The term (73) vanishes by (3.21) and 
b(76) = 0 is a consequence of the proof of Theorem 3.3. Therefore (3.26) is equjv- 


alent to w(7313) = 0. We Summarize our computation in the following theorem. 


Theorem 3.7. 77Pe Pocessing 1ietjod 办 = XPo 玫 no 汉王 1 oforderDp(D 近 8) 矿 


e@ 1je coe 太 icien1s i oripe KemelsatisA te co1a1ditio11s or1pe Lefi colu11110 z0 已 KG11I- 
Pie 3.9,1e., 3 co1difio1y Jor order 6, da1d 5 co1mditio11y Jor order 8; 

e@ 1jpe coejjicien1sy 0i or 1Pe pyocessor da1e SUCcH 1Pat (3.21) jpo1ds (4 comdiftions 加 r 
ojder 6, G1d 8 co11ditio1y Jor ojder 8) ad maddition coadition (3.24) Jor ojder 
6, a1d (3.2 和 了 (3.23) (3.27) Jororder 8 are yatisjieda. 














Remark 3.8. Although we have presented the computations only for p 和 8, the 
result is general. All trees 7 E 人 ,which are not of the form 了 三 人 o GD，give 
Tise to conditions on the kernel 人 (for a Similar result in the context of Runge 一 
Kutta methods see Butcher 包 Sanz-9erna (1990)). The remaining conditions have 
to be Satisfied by the coefficients of the processor. Due to the reduced number of 
order conditions, it is relatively easy to construct high order kernels. However the 
difficulty in constructing a Suitable procesSor increases rapidly with the order. 


The application of the procesSsing technique is two-fold. A first possibility ls 
to take one of the high-order composition methods of the form (3.2), e.g., one of 
those presented in Sect.V.3.2, and to exploit the freedom in the coefficients of the 
processor to make the error constants Smaller. 

Another possibility ls to start from the beginning and to construct a method 开 记 
with coeffticients Satisfying only the conditions of Theorem 3.7. Methods of effective 
order 6 and 8 have been constructed in thlis way by Blanes (2001). 


V.4 Symmetric Methods on Manifolds 


Numerical methods for differential equations on manifolds have been introduced 
in Sections IV.4 and IV.3. The presented algorithms are in general not Symmetric. 
We discuss here suitable Symmetric modifications which often have an Improved 
long-time behaviour. We consider a differential edquation 


= 帮 o)， jy) ETyA (4.1) 


on amanifold 人 4, and we assume that the manifold is either given as the zero Set of 
afunction 9g(y) or by means of a suitable parametrization y = P(2). 
V.4.1 Symmetric Projection 


Due to the projection at the end of an integration Step, the standard projection 
method (Algorithm IV.4.2) is not Symmetric (See Fig.IV.4.2). In order to make the 
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overall algorithm symmetric, one has to apply a kind of "inverse projection ”at the 
beginning of each integration Step. This idea has first been used by Ascher & Reich 
(1999) to enforce conservation of energy, and it has been applied in more general 
contexts by Hairer (2000). 


Algorithm 4.1 (Symmetric Projection Method). 4sszxrmre tat yn E 人 4. Opme step 
or hy op+ltsdejjed asjollows (see 8.471 71811 DiCctU1e 站 


e 态 王 克 十 GUO)71 wjpereg( 加 ) =0(pertarpation step 
e 人 态 1 三 2( 砚 ) (sy1a11ietric ole-step Jetpod applied io = 大) 六 
e 0 一 防 和 1 十 GO wmw 太 Asucptpafg(nti) 三 0(projectiom ste 有 ) 





Here, G(V) = 9(W) denotes the Jacobian of 9g(y). It is important to take a 
Symmetric method in the Second step, and the same vector /in the perturbation and 
projection StepS. 


AM AM 
一 六 此 加 2 
2V0 


Fig. 4.1. Standard projection (left picture) compared to Symmetric projection (righb) 


上 xistence of the Numerical Solution. Thphe vector /4 and the numerical approxima- 
tion yl are Implicitly defined by 


人 本 n 7 @ 亿 信 C@ 刀 工 
2 ( + 一 于 (二 GO) 一 G(nl) 国 下 
9(yn1) 
Since 天 (0,yn,0) = 0 and since 
OP 了 | 
一 一 (00) = 4.3 
二 (ce 0 0 


is invertible (provided that G(y) has full rank), an application of the implicit func- 
tion theorem proves the existence of the numerical solution for suffticiently Small 
Step Size 几 . The Simple Structure of the matrix (4.3) can also be exploited for an 
effhcient Solution of the nonlinear System (4.2) using Simplifed Newton iterations. 
下 the basic method 2jn is itself implicit, the nonlinear System (4.2) Should be solved 
in tandem with 思 +1 三 2(7). 


Order， For a study of the local error we let yn := V( 如 ) be a value on the exact 
Solution y(t of (4.1). 开 the basic method 盏 jis of order P, ie, 这 yt 如 十 万 一 
By(t)) = O(P2+l ,wehave 开 (io 0) =O(P2+T). Compared to (4.2) 
the implicit function theorem yields 


V.4 Symmetric Methods on Manifolds 103 


grH 一 Was=ONT ) and =OT)， 


This proves that the Symmetric projection method of Algorithm 4.1 has the Same 
order as the underlying one-step method 呈 7 


Symmetry of the Algorithm.。 Exchanging 玉 人 一 六 and yn 全 yn+Hliinthe Algo- 
Tithm 4.1 yields 


Zr 二 Un 二 1 十 G(n+1TD)7 AN 9(Vn+1) 一 0， 
Zr+l 和 瑞 _P( 加 )， 


or 三 久 1 十 GO) gm)=0. 








The auxiliary variables /LVn， and 思 +1 can be arbitrarily renamed. If we replace 
them with 一 /yn+li, and yn, Tespectively， we get the formulas of the original al- 
gorithm provided that the method 2 of the intermediate step ls Symmetric. This 
proves the Symmetry of the algorithm. 

Various modifcations of the perturbation and projection steps are posSible with- 
out destroying the Symmetry. For example, one can replace the arguments yn and 
nfliinG(y) with (加 十 yn+1)/2.Itmight be advantageous to use a constant direc- 
tion ie., 太一 名 十 4710 1 = 了 力 和 十 471 with aconstant matrix 4. In this 
case the matrix G(V)47 has to be invertible along the solution in order to guarantee 
the existence of the numerical Solution. 


Reversibility。 From Theorem 1.3 we know that symmetry alone does not imply 
the 0-Teversibility of the numerical How. The method must also Satisfy the compat- 
ibility condition (1.4). It is Straighttforward to check that this condition is Satisfied 
寺 the integrator OP of the intermediate step of Algorithm 4.1 satisfies (1.4) and, in 
addition， 

2G(O) = GUO (4.4) 
holds with some constant invertible matrix o. In many interesting Situations We 
have 9g(py) = -79g(y) with a suitable o, so that (4.4) follows by differentiation 
让 po = 7 Similarly, when a projection with constant direction y 一世 十 477 is 
applied, the matrix 4 has to satisfy 0 47 = 47c for a suitably chosen invertible 
matrix Oo (See the experiment of Example 4.4 below). 


了 xample 4.2， Let us consider the equations of motion of a riglid body as described 
in Example IV.1.7. They constitute a differential equation on the manifold 


A = {(o2 四 )| 妈 十 好 十 她 =] 


and it is p-reversible with respect to _p(Wiyo,y3) = (一 Wo,ya), and also with 
Tespect to PUT ya 03) 三 (1 一刀;, 轨 ) and py 0) = (2 一 中) For a 
numerical Simnulation we take 六 2，712 1，713 2/3, and the initial value 
2%o = (cos(0.9),0,sin(0.9)). We apply the trapezoidal rule (II.1.2) with the large 
Step Size 玉 三 1inthree different versions. 

The upper Picture of Fig.4.2 shows the result of a direct application of the trape- 
Zoidal rule. The numerical solution lies apparently on a closed curve, but it does not 
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Fig. 4.2. Numerical Simulation of the riglid body equations. The three pictures correspond 
to a direct application (upper), to the standard projection (lower lefb, and to the symmetric 
projection (lower righb of the trapezoidal rule; 5000 steps with step Size 尺 三 工 


lie exactly on the manifold A4. This can be Seen as follows: the trapezoidal rule G7 
is conjugate to the implicit midpoint rule 5 人 4 Via a half-step of the explicit Euler 
method Xny/2. In fact the relations 


5 二 X212 9 XP/2 and 人 二 Xhn/2 9 X7.12 
hold, so that 
及 一 X1j2ogB1 ooxXna and (本 )” 王 Xiao( 到 ) ooXpny2. 
Consequently, the trajectory of the trapezoidal rule is obtained from the trajectory 
of the midpoint rule by a simple change of coordinates. On the other hand, the 
numerical Solution of the midpoint rule lies exactly on a solution curve because 
conserves quadratic invariants (Theorem IV.2.1). 

Using standard orthogonal projection (Algorithm IV.4.2) we obviously obtain a 
numerical solution lying on the manifold A4. But as we can See from the lower left 
picture of Fig.4.2, it does not remain near a closed curve and converges to a fxed 
point. The lower right picture Shows that the use of the Symmetric orthogonal pro- 
jection (Algorithm 4.1) recovers the property of remaining near the closed Solution 
CUrve. 
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上 xample 4.3 (Numerical Experiment with Constant Direction of Projection). 
We consider the pendulum equation in Cartesian coordinates (See Example IV .4.1)， 


dl 三 D1， D1 三 一 01 和 X， 


7 ， (4.5) 
d2 三 D2， Da 三 一 上 一 92 入 


with 入 = (2 十 22 一 02)/(o 二 ad This is aproblem on the manifold 


人 = {(oopipa)| 虹 十 吧 =1 opl 十 0apa =0}. 


It is p-reversible with respect to 0(ql,q2,Dpl,po) = (ql,q2, 一 D1, 一 D2) and also with 
respect to pq1, 9g2,D1,D2) 一 (一 91 92,D1, 一 D2). 

We apply two kinds of Symmetric projection 
methods. First， we consider an orthogonal projec- 
tion onto A4 as in Algorithm 4.1. Second, we project 
parallel to coordinate axes. More Precisely，wWe fx 
the first components in position and velocity 让 the 
angle of the pendulum is close to 0 or T (vertical 
projection in the Picture to the right), and we fx the 
Second components 这 the angle is close to 士 T/2 
(horizontal projection). The regions where the di- 
rection of projection changes, are overlapping. 

We notice in Fig.4.3 that for the orthogonal projection method the energy erT- 
Ior Temains bounded, and this ls also true for integrations over much longer time 
intervals. This ls in agreement with the observation of Chap.I，where Symmetric 
methods Showed an excellent long-titme behaviour when applied to reversible dif- 
ferential edquations. 




















Fig. 4.3. Global error in the total energy for two different projection methods - orthogonal 
and coordinate projection - with the trapezoidal rule as basic integrator. Initial values for 
the position are (cos 0.8, 一 sin 0.8) (left picture) and (cos 0.8, sin 0.8) (right picture); zero 
initial values in the velocity; step Sizes are 下 = 0.1 (solid) and 下 = 0.05 (thin dashed) 


For the coordinate projection, however we observe a bounded energy error only 
for the initial value that is close to equilibrium (no change in the direction of the 
projection is necessary). As Soon as the direction has to be changed (right plicture 
of Fig.4.3) alinear drift in the energy error becomes Visible. Hence, care has to be 
taken with the choice of the projection. For an explanation of this phenomenon we 
refer to Chap.IX on backward error analysis and to Chap. XI on perturbation theory 
of reversible mappings. 
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Fig. 4.4. Global error in the total energy for a Symmetric projection method violating (1.4). 
Initial values for the position are (cos 0.8, 一 sin 0.8) and (0, 0) for the velocity; step sizes 
are 丈 = 0.1 (Solid) and 玉 = 0.05 (thin dashed) 


了 上 xample 4.4 (A Symmetric but Non-Reversible Projection Method)， We con- 
Sider the pendulum equation as in Example 4.3. This time, however，we apply a 
projection 砚 王 如 十 471 1 三 芒 和 十 47 with 


E 1 0 0 
4=(: 0 0 由 E 王 0.2. 


FEor s = 0this corresponds to the vertical projection used in Example 4.3. For 
E 尖 0thereis no matrix o such that p47 = 47c holds for one of the mappings 
D that make the problem p-reversible. Hence condition (1.4) ls violated, and the 
method is thus not p-Teversible. The initial values are chosen such that 9g'( 轨 47 is 
invertible and well-conditioned along the Solution. Although the projection direction 
need not be changed during the integration and the method is Symmetric, the long- 
time behaviour is disappointing as Shown in Fig.4.4. This experiment illustrates that 
condition (1.4) is also Important for a quajlitatively correct Simulation. 


V.4.2 Symmetric Methods Based on Local Coordinates 


Numerical methods for differential equations on manifolds that are based on local 
coordinates (Algorithm IV.3.3) are in general not Symmetric. For example, 让 we 
consider the parametrization (IV.3.8) with respect to the tangent Space at yo, the ad- 
joint method would be parametrized by the tangent Space at V1. We can circumyvent 
this difhiculty by the following algorithm (Hairer 2001). 


Algorithm 4.5 (Symmetric Local Coordinates Approach). 4ssumze 1aty E 人 4 
a1d 加 at Un 1 a Vocal para1aietrization of 人 Satishing Wa(0) = a (close 1o V 由 
O1e Step on FUVnH11 dejzed asjollows (See 8. 趟 9 


e Jijid zz (close 1o 0) sucp tpatwWa(2) 一 Wor 

e@ 2T1 一 Gp(20) (sy1a11letric oO11e-step 111etpod applied io (7V.. 1 
e UVm+l 一 Wo 2+1); 

@ cjpooseaw 态 1pe pa1a1aetrization Sucpn ipat 2zn 十 轧 +1 三 0. 





下 1 DiDortaat io 7e1aaj1 天 1 态 at 态 e pa1a11aetrization yy 一 Un 人 2) 好 六 8e1ieral cpna1ged 
17 eyery sfeD. 
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22 
Fig. 4.S. Symmetric use of local tangent space parametrization 


This algorithm is illustrated in Fig.4.3 for the tangent Space parametrization 
(V.3.8), given by 
al] =a+Q(o)z 二 9 (ao wa(， (4.0) 
where the columns of Q(a) form an orthogonal basis of Tu and the function 
Wwa(z) is such that Wu(z) < 人 .Itsatisfies wa(0) 一 0andw(0) = 0. 


了 xistence of the Numerical Solution. In Algorithm 4.3 the values w E 人 4 and zz 
are implicitly determined by 


(4.7) 


元 二 交 二 Ce = 0， 


Upa(2p) 耕 -2 mm 


and the numerical Solution ls then explicitly given by yn +1 三 Wu ( 环 ( 2)). For 
more clarity we also use here the notation W(z,a) = Va(2). 于 the parametrization 
U(z,aj is differentiable, we have 


下 27 0 
一 一 一 (0,0,) = 4.8 
克 可 (0 on) 人 要 强人 0 


Since W(z,a) E 人 for al z and w E 人 A, the derivative with respect to a lies 
in the tangent space. Assume now that the parametrization V(z, Q) is such that the 
testriction of (0,yn) onto the tangent space Ty Ais bijective. Then, the matrix 
(4.8) is invertible on 了 月 4 x TvA4 (ad denotes the dimension of the manifold). The 
implicit function theorem thus proves the existence of a numerical solution (2 al) 
close to (0,yn). In the case where Wual(z) is given by (4.6), the matrix 

OU 

页 (= 一 go (go ) 9 

is a projection onto the tangent space Ta 人 A4 and satisfies the above assumptions 
provided that 9g'(a) has full rank. 


一 工 


Order We let yn := (如 ) be avalue on the exact solution y(t of (4.1). Then we 
fx a Ee A4 as follows: wereplace theupperpart ofthe definition (4.7) of 已 (PP xn a) 
WwWith 2zn 十 Pt(z)， where 人 denotes the exact fow of the differential equation 
for z( 旭 equivalent to (4.1). The above considerations show that such an a exists; 
let us call it ao*. If Gj is of order p，we then have 灰 ( 妨 z( 加 ) ao) = O(P2+1). 
An application of the implicit function theorem thus gives 2 一 z( 刀 ) = O(Rh2+T)， 
implying 男 +1 一 z(t1) = O(P+l,andfinally alsoy li 一 vi) = O(AP+T). 
This proves order p for the method defined by Algorithm 4.5. 
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Symmetry，。，Exchanging 中 全 一 and yn 全 yn+lin Algorithm 4.5 yields 


Wa(zn) 王八 和 0 匣 行 二 _P( 2)， Vm 一 Wua( 21)， 2 十 2+1 二 0. 


I 开 we also exchange the auxiliary variables 2 and 轧 +1 and 直 weuse the symmetry 
of the basic method 2n，we regain the original formulas. This proves the Symmetry 
of the algorithm. Again various kinds of modifications are possible. For example， 
the condition 2 十 罗 +1 三 0 can be replaced with 2 十 入 +1 三 X(P 2 2T1) .下 
X( 一 PiUU) 三 X(Puu),the symmetry of Algorithm 4.5 is not destroyed. 


Reversibility。In general, we cannot expect the method of Algorithm 4.3 to Satisfy 
the 0-compatibllity condition (1.4), which is needed for p-reversibijlity. However, 下 
the parametrization is Such that 


pwWau(z) = Woa(az) for Some invertible e ， (4.9) 


we Shall Show that the compatibility condition (1.4) holds. We first prove that 
for a _p-reversible problem 7 = jy) the differential equation (TV.5.7)，written as 


冯 三 Eu(2), is a-Teversible in the Sense that aFu(2z) 三 一 Foa(a2z). This follows 
from pwWa(z) = Woa(az)a (which is seen by differentiation of (4.9)) and from 


JWoa(az)) = 一 DJ(Wa(z)), because 
We(z)Ea(2) = UVa(2) 二 Vpo(oz)oma(z) = 一 /poola 人 让， 


Ifthe basic method @j satisfies co 到 一 Goao whenappliedto= 瑟 (z) (e.g 
for all Runge-Kutta methods), the formulas of Algorithm 4.3 Satisfy 


0 ynm 二 DWo(zmn) 一 Woa(a2m)， CO2mn 二 1 二 GE_n(a2nh)， 
Wpa(a 思 二 1 一 DWa(2 上 1 二 0 Vn 十 1， 0O2m 十 027+1l = 0. 


This proves that, starting with pyn and a negative step Size 一 /, the Algorithm 4.5 
produces DVn+1，where yn+ll just the result obtained with initial value yand 
step Size 凡 . But this is nothing other than the p-compatibility condition (1.4) for 
Algorithm 4.5. 

In order to verify condition (4.9) for the tangent Space parametrization (4.0), we 
write it as Wu(2) = 十 和 十 NG), where 2 is an arbitrary element of the tan- 
gent Space Tu and V(2) is orthogonal to TuA4 such that Wa(2) E 人 .Since 
Ta = Ta 人 and since, for a _p satisfying pp7 = 了 the vector PN(2) is or- 
thogonal to Toa 人 4, we have pwWa(2) = Wpa(02G). This proves (4.9) for the tangent 
Space parametrization of a manifold. 


了 xample 4.6， We repeated the experiment of Example 4.2 with Algorithm IV.3.3， 
using tangent Space parametrization and the trapezoidal rule as basic integrator, and 
compared it to the Symmetrized version of Algorithm 4.5. We were SUIprised to See 
that both algorithms worked equally well and gave anumerical solution lying near 
aclosed curve. An explanation is given in Exercise 11. There it is Shown that for the 
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Fig. 4.6. Numerical Simulation of the rigid body equations; standard use of tangent Space 
parametrization with the trapezoidal rule as basic method (left picture) and its symmetrized 
version (Tight picture); 5000 steps with step Size 玉 三 0.4 


Special Situation where 人 4 is a Sphere, the standard algorithm is also Symmetric for 
the trapezoidal rule. Let us therefore modify the problem slightly. 
We consider the rigid body equations (IV.1.4) as a differential equation on the 
manifold 
2 2 2 
AM = fo 区 | 皇 + 坚 十 纤 一 Const} (4.10) 
D 7DP 26 
with parameters and initial data as in Example 4.2, and we apply the standard and the 
Symmetrized method based on tangent Space parametrization. The result is Shown 
in Fig.4.6. In both cases the numerical Solution lies on the manifold (by definition 
of the method), but only the Symmetric method has a correct long-time behaviour. 


Symmetric Lie Group Methods， We turn our attention to particular problems 
Y=4(00) YY) = 苞 ， (4.11) 


where 4(Y) is in the Lie algebra g whenever Y is in the corresponding Lie group 
C. The exact solution then evolves on the manifold C. Munthe-Kaas methods 
(Sect.IV.8.2) are in general not Symmetric，even 这 the underlying Runge-Kutta 
method is Symmetric. This is due to the unsymmetric use of the local coordinates 
7 = exp(O)2. However, accidentally, the Lie group method based on the implicit 
midpoint rule 


了 1=exp(O) 际 ， 有 =j4(exp(2]/2) 环 ) (4.12) 


is Symmetric. This can be Seen as Usual by exchanging 尺 人 一 六 and 世人 六 1 
(and also 1] 全 一 人 2 for the auxiliary variable). Numerical computations with the 
Tigid body edquations (conslidered as a problem on the Sphere) Shows an excellent 
long-time behaviour for this method similar to that of the right picture in Fig.4.0. In 
contrast to the Implicit midpoint rule (I.1.7), the numerical solution of (4.12) does 
not lie exactly on the ellipsoid (4.10); See Exercise 12. 


170 V. Symmetric Integration and Reversibility 


For the construction of further Symmetric Lie group methods we can apply the 
ideas of Algorithm 4.3. As local parametrization we choose 


Wr(O) = exp(O)D (4.13) 


where UV = exp(9) 世 plays the role of the midpoint on the manifold. We put 2 = 
一 @ so that Vr(2p) = 了 .With this starting value 2 we apply any Symmetric 
Runge-Kutta method to the differential equation 


2=4 )+ 天 害 a (4(0uo(O))， 20=-6， (14) 
上 


(cf. (TV.8.9)) and thus obtain 下 According to Algorithm 4.3, O is implicitly 
determined by the condition Zn 十 Zn+l 三 0, and the numerical approximation is 
obtained from 


了 玫 1 三 WOrr 人 9) 守 E exp( 马 1) exp(O) 区 =exp(29) 芭 


The method obtained in this way is identical to Algorithm 2 of Zanna，Engyg 文 
Munthe-Kaas (2001). With the _ coefficients of the 2-stage Gauss method (Iable 
JI.1.1) and witha 王 1in(4.14) we thus get 





3 工 3 
0; = (而 il 4])， 2 = 有 
芭 H1 = exp(29) 芭 = = emp(3 (4 二 42) 一 了 2([D， 41] + [22， 42]) ) 3 
where 4; = 4(exp(Ci) exp(6) 环 ).This is asymmetric Lie group method of order 


four. We can reduce the number of commnutators by replacing f2; in the right-hand 
expression with its dominating term. This yields 





3 到 2 丸 2 
(0 三 -4 | 区 [4 49]， f22 三 4 一 过 [4i, 4a] 
(4.19) 


人 
we 所 [4 4) 丈 


(cf. Exercise IV.19). Although we have neglected terms of size O(j), the method 
remains of order four, because the order of symmetric methods ls always even. 
For any linear invertible transformation p, the parametrization (4.13) Satisfies 


pwWr(O) =pexp(O)U = exp(OOQp )pU = Wor(cD) 


with zc = pp-1. Hence (4.9) holds true. Ifthe problem (4.11) is p-reversible, ie,， 
04(Y) = 一 4(OY)p, then the truncated differential equation (4.14) is a-Teversible 
for all choices of the truncation Index qg. Moreover after the simpliftications that lead 
to method (4.13), the DO-compatibility condition (1.4) is also Satisfied. 
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The following variant is also proposed in Zanna, Enggy 广 Munthe-Kaas (2001). 
Instead of computing O from the relation 2 十 Zr+1 = 0, Oils determined by 


I the coefficients Satisfy es+1 三 一 ei, this modification gives Symmetric Lie 
group methods. 


V.S Energy - Momentum Methods and Discrete 
Gradients 


Conventional numerical methods, when applied to the ordinary differen- 
tial equations of motion of classical mechanics, conserve the total energy 
and angular momentum only to the order of the truncation error. Since 
these constants of motion play a central role in mechanics, it is a great 
advantage to be able to conserve them exactly. 

(R.A. LaBudde 多 D. Greenspan 1970) 


This section ls concerned with numerical integrators for the equations of motion 
of classical mechanics which conserve both the total energy and angular momen- 
tum. Their construction is related to the concept of discrete gradients. The meth- 
ods considered are Symmetric，which ls incidental but useful: in our View their 
good long-time behaviour is a consequence of their Symmetry (and reversibility) 
more than of their exact conservation properties; See the disappointing behaviour of 
the non-Symmetric energy- and momentum-conserving projection method in Exam- 
ple ITV .4.4. 


A Modified Midpoint Rule. Consider first a single particle of mass 7 in 及 3， 
with position coordinates dg( < 及 3, moving in a central force field with potential 
CU(q) = 六 (cl e.g， V(r) = -1 in the Kepler problem). With the momenta 
六 三 7md(b,the equations of motion read 


1 9 
= 一 D，D=-VT = 一 广 (cl 一 
(g) (cl) [四 


Constants of motion are the total energy 妃 = 了 T(p) 十 V(q)，with T(p) = 
2ll27/(2r), and the angular momentum 二 一 d xD: 


d : 1 1 
二 (gxD 人 = 和 xDp+qxDD= 二 Dxp 一 (el 一 xdq=0. 
老 7 al 


We know from Sect.IV.2 that the implicit midpoint rule conserves the quadratic 
invariant 了 一 0X2,andTheoremIV.2.4(orasimple direct calculation) shows that 
了 remains actually conserved by any modification of the form 
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几 1 
太 二 下 6 天井 一 元 PPmp 十 1/2 于 (Dn 十 Dn+i) 
Jo+l 二 加 天 Pr with “ G.D 


2Dn+1 三 2 一 AAAVU(qn+l/2) dmn+1/2 一 二 (an 未 dn+1) 








where Ai an arbitrary real number Simo，Tarnow 女 Wong (1992) introduce 
this additional parameter A and determine it So that the total energy 1S conserved: 
五 (pn+ldqn+l) = 五 (pn dan). With the notation Fn+l/2 = -VU(d+l2) 三 
一 友人 eayalDXlen+ayal . gn+1/2 We have 


上 及 
T(pn+1) 三 了 (pn 十 KPI1/2) 一 了 (nn) 十 0 了 n+1/2 ， 





and hence the condition for conservation of the total energy 妃 = 全 十 L becomes 


岂 
上 元 Pp+1/2 Fr+l/2 一 Von) 一 Von+l) 


This gives a reasonable method even 下 2 +112 n+l/2 直 arbitrarily close to Zero. 
This is seen as follows: let a = 一 & 太 (oayal)Aaraiyall so that Fn+Hl/2 二 
adn+1/2. The above condition for energy conservation then Teads 


玉 
G 元 PrAl/2 qn+1/2 三 冯 (ob 一 站 (lw ， 


Where we note further that 


于 天 本 所 dnil/2 = (qh 一 an 二 (nl 十 gm) 
王立 ( 人 losei 习 一 | 三 (easall 一 eolD) 斌 (lensall+llenl) 
These formulas give 


Yo 二 dllelp 1 (5.2) 


ons 一 人 en 寺 lleosall 二 lenl) 





with which method (5.1) becomes 


用 
qdn+1l 三 9n 十 万 Pr+1/2 
(3.3) 





了 ons 一 立 人 oo dn+172 
| 一 人 eol 过 losal+llos) 





Dmn+l 二 Dm 


This ls a Second-order Symmetric method which conserves the total energy and 


the angular momentum. It evaluates only the potential V(9) = Y(|cl. The force 
--VV(g) = 一 友人 cl) 1 is approximated by finite differences， 


The energy- and momentum-conserving method (5.3) first appeared in LaBudde 
人 Greenspan (1974). The method (S$.1) or ($.3) ls the starting point for extenslions 
in Several directions to other problems of mechanics and other methods; See Simo， 
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Tarnow 女 Wong (1992), Simo 多 TIarnow (1992), Lewis & Simo (1994, 1996), Gon- 
zalez 儿 Simo (19960)，Gonzalez (1996), and Reich (19596b). In the following we 
consider a direct generalization to Systems of particles, also given in LaBudde 祥 
Greenspan (1974). 


An Energy-Momentum Method for N-Body Systems. We consider a system of 
AN particles interacting pairwise with potential forces which depend on the distances 
between the particles. As in Example IV.1.3, this ls formulated as a Hamiltonian 
System with total energy 


1 一 1 


五 (p, 9) 三 一 3 人 rp+ > (le- ol (3.4) 


2 


As an extenslion of method (3.3), we consider the following method (where we now 
write the time Step number as a Superscript for notational convenience) 


侯 十 1 也 久 7 十 172 











由 人 
和 (3.3) 
时 十 1/2 二 1/2 
关怀 
J=1 
Where PPT /将 于 (D2 十 PP )， 0 一 直 (@2 十 gt+T), and fori > 沁 
0 (5.6) 
她 二 人 二 I I * 
全 一 7 各 于 (7 人 填 人 ) 
with r 攻 三 | 他 一 四 and oz 三 0.This method has the following properties. 


Theorem S.1 (LaBudde & Greenspan 1974). 71e 7aetpod (5.3) mi 如 (3.6) 1 0 
SeCO11d-oO1der SUNeB7EC 111DLicit 1etjod WAPicp COPSermes 1 1je lolaL 710eQ7 110O111e11- 
12111 书 一 二 此 1Di， tpe olal a18LLa7 11O11e111111 了 三 浊 尖 19i X Di G1d 1 加 e 1o1al 
e11e18y 万 . 


Proojf A comparison of (5.6) with the edquations of motion Shows that the method 
is of order 2. Similar to the continuous case (Example IV.1.3), the conservation 
of linear and angular momentum is obtained as a consequence of the Symmetry 
0ij 一 ojii forall ?7. Forthe linear momentum we have 


人 2 四 人 9) - 交 


上 


For the proof of the conservation of the angular momentum we observe that the 


first equation of ($.5) together with PP+12 一 二 (D2+ 1 十 28) yields 


(一 gx (of 十 2) 一 0 (5.7) 
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for all ?. The second equation of (5.3) together with qd9+1/2 一 (qd; 9 十 qd ) gives 
和 
>》 (他 十 用 ) x (故人 一 下 ) = 0， (5.8) 
i 一 1 

because ai = ojii and therefore 到 -105 gd2+1/2 x 1 一 0.Adding the sum 

over 1 of (3.7) to the equation (3.8) proves the 人 守 记 汉 下 久 2 二 1 


于 gf X DY 
It remains to Show the energy conservation. Now, the kinetic energy 7Z(D) = 三 


N | 丘 
5 > -17 2D7 Di at step 你 十 工 1S 


TV 


T(p2+1) 径 T(zo) 区 Ts DLL Jr wm Ce 王 人 


和 IJ 一 1 


We 
= TOn)+》 > oa (一 人 (0 
i1 7=1 





Using once more the symmetry coi7 = oji, the double sum reduces to 


N N z1 
3 >》， 0 订 《Ca dg) ( 力 ) 了 ((o 人 一 0 市 (@ 这 J)) 
1 一 1 7 一 1 
1 一 1 本 
二 汉 流 3 6 了 (3 人 ) 


未 





On the other hand, the change in the potential energy is 


WN 1 一 1 


Ti- = > (St 一 合 ( 雹 ))， 


人 





and hence (3.0) yields the conservation of the total energy 互 = 不 十 已. 











Discrete-Gradient Methods. The methods ($.3) and ($.5) are of the form 


Zn+l 二 yn 宙 几 古 (1 2 ) 立 末 (yn 2 ) (35.9) 


where 万 (办 攻 is a skew-symmetric matrix for all 亿 V and V 万 (办 轨 is adiscrete 
8S1adient of , that is, a continuous function of ( 砂 y) satisfying 


VY 万 轧 人 作 一 切 一 玖 玉 一 鼠 () 


(5.10) 
V 万 (yy) = V 万 (y) 


The symmetry of the methods is seen from the properties 万 (办 人 急 = 互 ( 念 and 
YV 万 (办 人 = YV 万 (办 . For example, for method ($.3) we have, with y = (D,9g) 
and 允 二 (2 9， 
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万 /全 0 一 4 元 厅 / 信 天 (全 十 刀 ) 
万 (四 人 切 = and  V 万 (办 2 三 有 
上 0 ) we 人 去 (2+9) 


Where al(g dj is given by the expression (5.2) with (0 q) in place of (an+i,dn) OF 
by the corresponding limit as dl| 一 jcl. 
The discrete-gradient method (3.9) ls consistent with the differential equation 


= BU) YU) (G.1D) 

with the skew-symmetric matrix 巨 (y) = 万 (yy 四) .This system conserves 万 , Since 
d 

丰 


and, as was noted by Gonzalez (1996) and McLachlan, Quispel 和 Robidoux (1999)， 
刀 js also conserved by method (3.9). 


万 (人 =Y 万 (人 攻 =Y 感 (7 BY =0， 


Theorem S.2. 77Pe discrete-81adient 111ethod (5.9) co1lyervesy 1ne ivariant 瑟 oftpe 
SySte112 (3.77). 


Proof The deftnitions (3.10) of a discrete gradient and of the method (3.9) give 


五 (yn+1) 室 五 (yn) 二 立 吾 (yn+l， or) (nl 二 om) 
一 尺 V 万 (yn+l， Zr) 已 (n+ om)V 巨 (noyn)=0， 











Where the last equality follows from the skew-symmetry of 巨 (ym ri; on)， 





卫 xample S$.3， The Lotka-Volterra System (I.1.1) can be written as 


多 0 一 U 
国史 0 ) va 
with the inyariant (wv) = mv 一 必 十 2Invuv 一 vof (.1.4). Possible choices of a 


discrete gradient are the coo1dzjzate ice1tet discrete 81adient (Itoh 久 Abe 1988) 


万 ( 动 v) 一 瓦 (uv) 








雹 厅 ( 信 全. 全 一 亿 
V 万 (了 人 方 岂 V) 三 天 依从 二 所 操作 (S.12) 
分 一 


and the 11idpoipat discrete 81adie1t (Gonzalez 1990) 


玉 失 一 及 轨 -YEGJTAy 


V 万 世人 = YY 万 力 1 [ET 





(5.13) 
with 到 = 寺 ( 分 十 9) and Ay = 分 一 人 卫 contrast to (5$.12), the discrete gradient 
(3.13) yields a Symmetric discretization . 


A systematic Study of discrete-gradient methods is given in Gonzalez (1990) 
and McLachlan, Quispel & Robidoux (1999). 
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V.0 上 Xercises 


1. 


光 


.Let On 三 o 电 oO，Where 0 


Prove that (after a Suitable permutation of the Stages) the condition cs+1_; 三 
1T 一 ci (人 (forally?)is alsonecessary for a collocation method to be Symmetric. 
Prove that explicit Runge-Kutta methods cannot be Symmetric. 

丽人 入 a one-step method applied to y = 》y yields Wi = 尽 (PA)yo then, a 
necesSary condition for the symmetry of the method is 民 (2) 尽 (-2) = 1 forall 
complex >. 


. Consider an irreducible diagonally implicit Runge-Kutta method (irreducible 


in the Sense of Sect. VI.7.3). Prove that the condition (2.4) 1S necesSsary for the 
Symmetry of the method. No permutation of the stages has to be performed. 

加 加 represents the exact fow ofy = JI 四 (四 ). In the 
Situation of Theorem IIU.3.17 prove that the local error (3.4) of the composition 


method (3.3) has the form 
1 1 
恨 ( 序 (6a- DIDz[Dz, Dj+ 瑟 4-6a+6a[Du[Du Dal)raly)， 


where, as usual, Dig(y) = 9 (7 加 (四 . The value a = 0.1932 is found by 
minimizing the expression (6aw -- 1)2 十 4(1 一 6a 十 6a2)2 (McLachlan 1995). 


. Forthe linear transformation po(D, 9) = (一 D, q),consider a p-Teversible problem 


(1.3) with scalar p and 0. Prove that every solution which crosses the q-axils 
twice is periodic. 


. Prove that 过 a numerical method conserves quadratic invariants (IV.2.1), then 


So does its adjoint. 


. For the numerical solution of y = 4( 信 V consider the method yn 一 yn+l 


defined by yn+1 = 2( 志 十 妃 , where z( 区 is the solution of 
= 4 的 z， xz() =g， 


and 4 人 is the interpolation polynomial based on Symmetric nodes cl, ... ,cs， 
le., Cs+1_i 十 ci 一 1forall;. 

a) Prove that this method is Symmetric. 

b) Show that yn+1l = exp(V2( 由 )Vyn holds, where (2(P) has an expansion in odd 
powers of 几 . This justifties the omission of the terms involving triple integrals 
in Example IV.7.4. 


. 了 开 2n stands for the implicit midpoint rule, whpat are the Runge-Kutta coe 侍 - 


cients of the composition method (3.8)2 The general theory of Sect.IIU.1 gives 
three order conditions for order 4 (those for the trees of order 2 and 4 are auto- 
matically Satisfied by the Symmetry of the method). Are they compatible with 
the two conditions of Example 3.37 


. Make a _ numerical comparison of our favourite composition methods 26 s9， 


2D8 sl17, and Dp10 s35 for the Lorenz problem 


内 三 一 (一 纺 ) Mi(0) = 10 0=10 
三 一 Was 二 7 一 (0) = 一 20 7 一 28 (0.1) 


0 三 yo 一 00s ma(0)=20 =8/3 
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Fig. 6.1. Comparison of various composition methods applied to the Lorenz equations 


10. 


11. 


with exact Solution 


Ji(1) = 8.635692709892506017930544628639 
(1) = 2.798663387927457052023080059065 (0.2) 
ya(1) = 33.36063508973142157789185846267 


by composing for0 乞 寺 冬 1 the Second order sy1z111etric SPUL8 Scheme (See 
Kahan 儿 Li 1997) which, for the time-stepping Wi mm ,is given by 


几 
这 二 住 三 有 十 妆 一 22 一 玖 )) 





刀 

和 妇 一 多 二 了 ( V1Y3 一 好 Vs 十 7 十 7 辽 一 思 一 芭 ) (6.3) 
刀 

玖 一 同一 可 ( 几 玉 十 区 思 一 bs 一 53). 


This method requires, for each step, the solution of a /inear System only. The 
results are Shown in Fig.0.1. 

9y111111et1jized order coaditio11y (SuZzuki 1992)，Prove that for methods (3.8) of 
order four with 7i Satisfying (3.10) 


5 大 2 S 太 1 
> 2 =-0 这 > 说 (站 =0 
K=1 L=1 K=1 L=1 《一 K 

The prime after (before) a sum Sign indicates that the term with highest (low- 
est) index is divided by 2. Prove also that the order conditions given in Suzuki 
(1992) for orderDp 和 8 are equivalent to those of Example 3.3. Is this also true 
for order2 = 107 

ZL Use relations like 到 一 1 荆 有 2 . 


Let AM = {(y2;0)| 姑 十 邮 十 好 = 1 ,and consider for o e AM the 
tangent Space parametrization 


Wal(z) 一 Q 十 2 十 aua(2)， 
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Where, for z E 思 5A, the real value wa(z) is determined by the requirement 
ua(2) E 人 4. Prove that Algorithm IV.5.3, with the trapezoidal rule in the role 
of oh, is asymmetric method. 

AL Since 2is alinear combination of ao and Wo(zj,itis uniquely determined 
by az z (which is zero) and Wo( 2z) 工 >. 

(Zanna，Engy 贸 Munthe-Kaas 2001). Verify numerically that the Lie group 
method (4.12) based on the implicit midpoint rule does not conserve general 
quadratic first integrals. One can consider the riglid body equations in the form 
(TV.1.5). 


Chapter VI. 
Symplectic Integration of Hamiltonian 
Systems 





Fig. 0.1. Sir William Rowan Hamilton, born: 4 August 1805 in Dublin, died: 2 September 
1865. Famous for research in optics, mechanics, and for the invention of quaternions 


也 amiltonian Systemas form the most important class of ordinary differential equa- 
tions in the context of “Geometric Numerical Integration . An outstanding property 
of these Systems is the Symplecticity of the fow. As indicated in the following dia- 
gram， 

Ordinary Differential Equations 


of motion canonical 
(Lagrange) 人 ( Hamilton) 


2 


1St order Partial DE 
把 -> GOenerating Functions 
(Hamilton-Jacobi) 


Variational Principles 
(Lagrange, Hamilton) 


Hamiltonian theory operates in three different domains (equations of motion, partial 
differential equations and variational principles) which are all interconnected. Each 
of these viewpoints, which we will study one after the other, leads to the construction 
of methods preserving the Symplecticity. 
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VI.1 Hamiltonian Systems 


了 amilton's equations appeared first, among thousands of other formulas, and in- 
Spired by previous research in optics, in Hamilton (1834). Their importance Was im- 
mediately recognized by Jacobli, who stressed and extended the fundamental ideas， 
So that, a couple of years later, all the long history of research of Galilei, Newton， 
Euler and Lagrange，was, in the words of Jacopi (1842), “to be conslidered as an 
introduction ”. The next mile-stones in the exposition of the theory were the monu- 
mental three volumes of PoincarE (1892,1893,1899) on celestial mechanics, Siegel's 
“Lectures on Celestial MechanicsS”(1956)，English enlarged edition by Siegel 公 
Moser (1971), and the influential book of V.I. Arnold (1989; first Russlian edition 
1974). Beyond that, Hamiltonian Systems became fundamental in many branches of 
physics. One Such area, the dynamics of particle accelerators, actually motivated the 
construction of the first symplectic integrators (Ruth 1983). 


VI.1.1 Lagrange”s 上 quations 


Equations diff&rentielles pour la solution de tous les problsames de Dy- 
namique. (J.- 革 . Lagrange 1788) 


The problem of computing the dynamics 
of general mechanical Systems began with 
Galilei (published 1638) and Newton's Prz7- 
ciPpiQ (1687). The latter allowed one to reduce 
the movement of free mass points (the “mass 
points”being Such planets as Mars or Jupiter) 
to the solution of differential equations (See 
Sect.[I2). But the movement of more com- 
plicated Systems Such as riglid bodies or bod- 
ies attached to each other by rods or Springs， 
were the Subject of long and difficult devel- 
opments, until Lagrange (1760, 1788) found 
an elegant way of treating Such problems in 
general. 

We Suppose that the position of a mechan- 
ical System with c degrees of freedom is de- 
scribed by dg = (qi ,qd)7 as gemeralized 
coo1di1iates (this can be for example Cartesian coordinates, angles, arc lengths along 
a curve, etc.). The theory is then built upon two pillars, namely an expresSion 


了 一 了 (9 9) 41.1D) 


which represents the Lietic energy (and which is often of the form 307 AM(q)d 
where M(a) is symmetric and positive definite), and by a function 





Joseph-Louis Lagrangel 


1 Joseph-Louis Lagrange, born: 25 January 1736 in Turin, Sardinia-Piedmont Cnow Italy)， 
died: 10 April 1813 in Paris. 
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芝 =C(9) (1.2) 


Tepresenting the pote1tial eergy. Then, after denoting by 


五 = 人 一 这 (1.3) 
the corresponding Zagragiam, the coordinates dl ( 轨 ,，.,dd(t obey the differential 
equations 

qd /07 D7 
人 (1.4) 
qt \ oO oOd 


Which constitute the ZLagra1lge eqguations of the System. A numerical (or analytical) 
integration of these equations allows one to predict the motion of any Such System 
from given initial values (Ce Sont ces Equations qui SerViront a deEterminer la courbe 
deEcrite par le corps AM et Sa vitesse a chaque instantf ; Lagrange 1760, p. 309). 


Example 1.1. For a mass point of mass 7n in 及 3 with Cartesian coordinates Z 一 
(Zl;zZay7Z3)7 wehaveT(z) 一 mm( 字 十 让 十 2)/2. We suppose the point to move 
in a conservative force field 忆 (Z) = --VUV(z). Then, the Lagrange equations (1.4) 
become 7 = 下 (Z)， which is Newton's second law. The equations (I.2.2) for the 
planetary motion are precisely of this form. 


上 xample 1.2 (Pendulum). For the mathematical pendulum of Sect.I.1 we take the 
angle a as coordinate. The kinetic and potential energies are given by 人 丰 = 7m(2Z2 十 
22)/2 = ml2G2/2 andU = mgy = 一 mglcos arespectively, so that the Lagrange 
equations become 一 mglsin a 一 7np2? 人 = 0 or equivalently 人 十 Ysin a = 0. 


VI.1.2 Hamilton”s Canonical Equations 


An diese Zamziltozscpe Form der Differentialgleichungen werden die 
ferneren Untersuchungen，welche den Kern dieser Vorlesung bilden， 
ankntpfen; das Bisherige ist als Einleitung dazu anzusehen. 

(C.G.J. Jacobi 1842, p. 143) 


了 Hamilton (1834) Simplified the structure of Lagrange's equations and turned them 
into a form that has remarkable symmetry, by 


e introducing Poisson's variables, the conjugate 12o11e111C 


元 
天 = 人 or 
Or 


e@ considering the 玉 Cz1zzl1o7zza17 
囊 汪 一 09 (1.6) 


as a function of p and q, ie., taking = 万 (p,q) obtained by expressing gd as a 
function ofp and dg via (1.3). 


Here it is, of course, required that (1.3) defines, for every 9q, a continuously differ- 
entiable bijection d 一 D. This map is called the Zege1adye 1ra11s1o7710. 
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Theorem 1.3， Zagramges eguatio11y (7.4) G1e eguiyalent io 已 a11a1ilio11 sy eqgUa1io18 
O 万 0 万 
及 三 一方 一 (D, 9)， 驮 三 去 一 (D,9)， 太一 1，d. (1.7) 
5 丙 (29) 


Proof The deftinitions (1.5) and (1.6) forthe momentap and for the Hamiltonian 妃 
imply that 





0 
页 “7? 男 而 和 

8 rs 8 sr 上 
or a0 6 6 6r 











The Lagrange equations (1.4) are therefore equlivalent to (1.7). 





Case of Quadratic 人 . mn the case that 全 = 97M(9q)dis quadratic, where M(9) 
is a symmetric and positive definite matrix, we have, for a fixed q,Dp = M(dq)d, so 
that the existence of the Legendre transform is established. Further, by replacing the 
variable d by M(q)-ipin the definition (1.6) of 瑟 (p, ,we obtain 


刀 29g) = D MO 2 一 LOMOD 
= PTMOO -2 5-ipTUG) = 了 DMIip+TUO 


and the Hamiltonian ls 妃 = 人 十 L which is the iolal energy of the mechanical 
System . 


In Chap.I we have Seen Several examples of Hamiltonian Systems, e.g., the pen- 
dulum (I.1.13), the Kepler problem (1.2.2), the outer Solar System (1.2.12), etc. In the 
following we consider Hamiltonian systems (1.7) where the Hamiltonian 妃 (P, 9) is 
arbitrary, and so not necesSsarily related to a mechanical problem. 


VI.2 Symplectic Transformations 


The name “complex group”formerly advocated by me in allusion to line 
complexes, ... has become more and more embarrassing through colli- 
Sion with the word“complex”in the connotation of complex number. 工 
therefore propose to replace it by the Greek adjective “symplectic” 

(H. Weyl (1939), p. 105) 


Atfirst property of Hamiltonian Systems, already seen in Example 1.2 of Sect. TV.1， 
is that the Hamiltonian 妃 (p, 9) is airst integral of the System (1.7). In this section 
we Shall study another Important property - the sy1iPjecticity of its fow. The basic 
objects to be studied are two-dimensional parallelograms lying in 有限 24. We Suppose 
the parallelogram to be Spanned by two vectors 


人 


in the (Pp, q) space (cP, 9, 71P, 1 are in 月 四 as 
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已 ={ 攻 +s110<t<10<s<1 
In the casew 三 1 weconsider the oriemied area 
6 
or.area ( 忆 ) = det 人 E2114 一 上 272 (2.1) 
(See left picture of Fig.2.1). In higher dimensions, we replace this by the xz of je 
ofie1tted ad1eas of tpe projections of Ponto ipe coordinate planesy (Di qi), ie., by 


QI z 也 地 
全 一 dt( 生 全) -We 
1 二 1 三 王 


i 一 1 
This defines a bilinear map acting on vectors of 及 24, which will play a central role 
for Hamiltonian Systems. In matrix notation, this map has the form 


wl 纪 人)= 上 7J1 with 了 J= 公 | (2.3) 


where y is the identity matrix of dimension w. 

Definition 2.1. A linear mapping 4 : 月 24 一 月 24 is called symplectic 证 
47.J4=7 

or equivalently, 这 w(4e, 47) = w( 和 四) forall cm eE 及 24. 


9 4 
47 
7 4 
一 一 2 
6 全 
D D 


Fig. 2.1. Symplecticity (area preservation) of a linear mapping 


In the case dg = 1, where the expression w(E,7) Tepresents the area of the paral- 
lelogram 尸 , symplecticity of a linear mapping 4 is therefore the area Preservafiom 
of 4 (see Fig.2.1). In the general case (qd > 1), Symplecticity means that the sum 
of the oriented areas of the projections of 已 onto (pi di) is the same as that for the 
transformed parallelograms 4( 己 ). 

We now turn our attention to nonlinear mappings. Differentiable functions can 
be locally approximated by linear mappings. This justifies the following deftinition. 


Definition 2.2.A differentiable map g : U 一 月 24 (where DC 了 24 is an open se 
is called symzplectic 过 the Jacobian matrix 9g'(P, q) is everywhere Symplectic, i.e., 让 
9 909 or oggD9059 (007) 三 ww 


Let us give a geometric interpretation of Symplecticity for nonlinear mappings. 
Consider a 2-dimensional Sub-manifold M of the 2ad-dimensional set V, and Sup- 
pose that it is given as the image M = W( 开 ) of a compact set 及 C 月 2，where 
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(sb is acontinuously differentiable function. The manifold M can then be con- 
Sidered as the limit of a union of Small parallelograms Spanned by the Vectors 


DU DU 
Ds 0 
For one Such parallelogram we consider (as above) the Sum over the oriented areas 


of its projections onto the (pi, %) plane. We then sum over all parallelograms of the 
manifold. In the limit this gives the expression 


加 DW DV 
二 Je 全 en 2 ds qt. (2.4) 


The transformation formula for double integrals implies that 2(AM) is independent 
of the parametrization 几 of AM. 


(s, 划 ads and 一 (5 让 dt. 


Lemma 2.3. 矿 1pe maappigg :Dr 一 月 2418symaplectic on 1 加 en 这 preservey 1Pe 
expD1essio1 12(AMI) Le， 

PC)) = 2OM) 
HPolds Jor al 2-di11ze11sio11G1 MG111o1ds AM 1pat ca1 pe 1ep1ese1ited dy 1je 10age oa 
CO11I11UOUSLY Gere1tiaple jazzctiozz 


Poof The manifold q( MT) can be parametrized by go W. We have 


Ji = 人 (2 (go 攻 GD 2 和 bwu= 200 


because (goW)/(s 昌 一 gf (Vs， 攻 (s,andgis asymplectic transformation. 


For qd = 1, M is already a subset of 及 2 and we choose 玉 = M with 炒 the 
identity map. In this case, (AM) = 岂 jz ds dl Tepresents the area of JMM. Hence， 
Lemma 2.3 states that all symplectic mappings (also nonlinear ones) are Gea P1e- 
SeE1V1118. 

We are now able to prove the main result of this section. We use the notation 
/= (D,9qj, and we write the Hamiltonian system (1.7) in the form 


六 = JJ V 厅 (J)， (2.5) 


where .JJ is the matrix of (2.3) and V 瓦 (W) = 万 (人 二 

Recall that the flow wy : U 一 了 24 of a Hamiltonian system is the mapping that 
advances the solution by time 二 Le., px(Do;,9o) = (D( 坟 Do,qo),d( 坟 Do,qo))， where 
2D( 坟 Do,qo), d(tDo,qo) is the solution of the System corresponding to initial values 
2(0) = po, 9(0) = 00 
Theorem 2.4 (Poincar6 1899). Zef (P,q) pe a twice cotiuotsly dijerentiaple 
Jonaction on U C 陈 24. 7Tjem jpreachp jixed 坊 1pejiomopriasynpiectic transjopraa- 
1io11 Wjeeyver iis de1ojzed. 

















Proof The derivative Opi/byo (with yo = (po,qo)) ia solution of the Vari- 
ational equation which，for the Hamiltonian System (2.3)， is of the form 忌 三 
JIV2 厅 (pi(yo)) 歼 , where V2? 厂 (p,d) is the Hessian matrix of 五 (p,q) (V2 厅 (pg) 
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Fig. 2.2. Area preservation of the flow of Hamiltonian Systemas 


ls Symmetric), We therefore obtain 
Ca 


=- (号 ) Yeto)777(2)+ (号 ) zeta 和 = 








Oyo Oywo Ovo Ovo 
because .JIL = -Jand .JTJ = -- 太 Sincetherelation 
Or 下 Op; 
一 JJ/ 一 三/ 2.0 
yo 】 Oyo 】 0 


is Satisfied fort 上 = 0 (po is the identity map), it is satisfied for all t and all (2o, ao)， 
as long as the solution remains in the domain of definition of 豆 . 














了 xample 2.S， We illustrate this theorem with the pendulum problem (Example 1.2) 
Using the normalization 7 二 《上 三 0 三 1.Wehaved=oa2D= Gandthe Hamil- 
tonian ls given by 
厂 (p,9) =/2 一 cos9. 

Fig.2.2 Shows level curves of this function, and it also illustrates the area Preser- 
vation of the fow pt. Indeed, by Theorem 2.4 and Lemma 2.3, the areas of 4 and 
o2x( 4) as well as those of 已 and wx( 刀 ) are the same, although their appearance is 
completely different. 


We next Show that Symplecticity of the fow is a characteristic property for 
Hamiltonian systems. We call a differential equation y = jy) 1ocal1y 瑟 azrazltomia1， 
让 for every yo E TU there exists a neighbourhood where jy) = .JIV 末 (JW for 
Some function 万 . 


Theorem 2.6. Letf 矿 : U 一 陈 24 pe comtipnzors1y djerentiaple. 7T1em 0 = 0) 8 
1ocally PEammailtonian 矿 azad oly ifi Jomw pr msSy1aplecticjoraly ET andJor 
GLL SU 太 cie1atiy SG1 志 
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Prooj The necessity follows from Theorem 2.4. We therefore assume that the flow 
or is Symplectic, and we have to prove the local existence of a function (y) such 
that /y) = .JIV 万 ( 力 . Differentiating (2.6) and using the fact that 90py/6wo isa 
solution of the variational equation 让 一 户 (wx(yo)) 亚 , we obtain 


8(( 品 ) 7( 王 )) = ( 吕 ](7leeojrzrorlelo))( 张 ) -0 


Ovo Oy0 Oyo 





Putting 一 0, it follows from .J = 一 .7 that JP(yo) is a symmetric matrix for 
all yo. The Integrability Lemma 2.7 below shows that .J.F(y) can be written as the 
gradient of a function 万 (y)， 














The following integrability condition for the existence of a potential was already 
known to Euler and Lagrange (See e.g.,，Eulers Opera O111110G， Vol. 19. p.2-3，oOF 
Lagrange (1760), p.373). 


Lemma 2.7 (Integrability Lemma). Zef 疡 C 了 ”pe open ad :万 一 玉 "”pe 
CO11DIUOUSLY Gerentiaple, aad assU111e 1Pat te Jacopian 户 () 1 Sy1a1aetric Jora1l 
VEDD.7Tjem jporevemy yo EDDtipereexrist a1ieigppoxrpood anad a jazzction 万 (W) 
SUCA 1Pat 


jg) =VU) (2.7) 
O1 妨 18 Jeigjppoxjood. 于 otjer wo1ds, tpe dizerential jorma 廊 (W) do 十 .十 
万 (W) dm = qd 万 BatotaldijerentiaL 


Prooj Assume yo = 0, and consider aball around yo which is contained in 刀 . On 
this ball we define 


1 
五 (y) = 区) 中 十 Comst. 


Differentiation with respect to Wk, and using the symmetry assumption 9 方 /6 三 
0 及 /bwi yields 


9 
DO 


g= (HE0Dw= 人 (0)w=AO 


开 














Which proves the statement. 


For 万 = 了 24 or for star-shaped regions 刀 , the above proof shows that the func- 
tion 如 of Lemma 2.7 is globally deftined. Hence the Hamiltonian of Theorem 2.0 
is also globally defined in this case. This remains valid for Simply connected Sets 
几 . A_ counter-example,， which shows that the existence of a global Hamiltonian in 
Theorem 2.0 is not true for general D, is given in Exercise 0. 

An important property of symplectic transformations, which goes back to Jacobli 
(1836，“Theorem X ), is that they preserve the Hamiltonian character of the differ- 
ential equation. Such transformations have been termed ca1o1ical Since the 19th 
century. The next theorem Shows that canonical and Symplectic transformations are 
the Same. 
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Theorem 2.8. Fet 力 : 了 一 太 peacpange ofcoordinaatey Sucp tatup azad 示 开 
G1e CO11DIUOMSUY Gere1atiaple jpactions. 1 1 Sy11Plectic, tpe 瑟 a11ailiomiamz syste110 


1 = JIV 万 (V) pecomrey 太 1pe memyvariaples zz 一 示 ( 攻 
汪 = JIVK(z) mw 碘 天 (>z) = 万 (人 力 . (2.8) 


Co1ive1se 太 矿 V 1ra1as1o71ay eyery 五 Ga11ailtomia1a sySte1a 10 Ga1101Per 本 Ga11azlio1zia1z 8yS- 
ie12 Via (2.8) 1 加 e1 mu 好 Sy11Djectic， 


Pioof Since 之 = 内 (7 and 录 (TVK( = YV 瑟 (JW, the Hamiltonian system 
0 = JIV 万 (y) becomes 


效力 7 Y 天 (3 C.9) 
in the new variables. It is edquivalent to (2.8) 下 
VODTWO = C.10) 


Multiplying this relation from the right by 妨 ( 轨 -7 and from the left by 录 ( 切 二 ! 
and then taking its inverse yields .] 三 信 ( 四 JW which shows that (2.10) is 
equivalent to the symplecticity of . 

For the inverse relation we note that (2.9) is Hamiltonian for all K(z) 让 and 
only 让 (2.10) holds. 














VI.3 First 上 Examples of Symplectic Integrators 


Since Symplecticity 1s a characteristic prop- 
erty of Hamiltonian Systems (Theorem 2.0)， 
it is natural to Search for numerical methods 
that Share this property. Pioneering work on 
Symplectic integration is due to de Vogelaere 
(1956), Ruth (1983), and Feng Kang (1985). 
Books on the now well-developed Subject are 
Sanz-9Serna 儿 Calvo (1994) and Leimkuhler 
久 Reich (2004). 


Deftinition 3.1. Anumerical one-step method 
jls called sy1azplectzc 寺 the one-step map 





全 三 2(yo) 


Feng Kang” 


ls Symplectic whenever the method is applied 
to a Smooth Hamiltonian System 


2 Feng Kang, born: 9 September 1920 in Nanjing (Chinaj, died: 17 August 1993 in Beijing; 
Picture obtained from Yuming Shi with the help of Yifa Tang. 
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Fig. 3.1. Area preservation of numerical methods for the pendulum; same initial sets as in 
Fig.2.2; first order methods (left column): 闵 = T/4; second order methods (right column): 
尹 王 T/3; dashed: exact flow 


了 xample 3.2， We consider the pendulum problem of Example 2.5 with the Same 
initial sets as in Fig.2.2. We apply Six different numerical methods to this problem: 
the explicit Euler method (I.1.5), the Symplectic Euler method (1L.1.9), and the im- 
plicit Euler method (1.1.6), as well as the Second order method of Runge (LI.1.3) 
(the right one)j, the Stormer-Verlet Scheme (11.17), and the implicit midpoint rule 
(1.7). For two sets of initial values (po, go) we compute Several steps with step Size 
几 王 T/4for the first order methods, and 几 = T/3 for the second order methods. 
One clearly observes in Fig.3.1 that the explicit Euler, the implicit Euler and the 
Second order explicit method of Runge are not Symplectic (not area preserVing). We 
Shall prove below that the other methods are Symplectic. A different proof of their 
Symplecticity (using generating functions) will be given in Sect. VI.5. 


In the following we Show the Symplecticity of various numerical methods from 
Chapters I and HI when they are applied to the Hamiltonian System in the vari- 


ables y = (D;, 9)， 
也 二 一 瓦 5 (D， d) 
4 三 五 p(P， d) 


Where 1 and 态 。 denote the column vectors of partial derivatives of the Hamil- 
tonian 瑟 (D, q) with respect to D and d, respectively. 


or equivalently 1 = :JIVT()， 
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Theorem 3.3 (de Vogelaere 195S6). 71e yo-calied sy1ap1ectic Eu1er ietjpodsy (1.9) 


Dip+L 二 DOmn 一 几 万 (Pn+1， dan) 2Dp+L 二 2Dpn 一 彤 万 (Pn， dn+1) G3 1) 
dmn+l1 三 qn 十 几 Ep(Ppn+l， dn) dmn+1 三 qn 十 HEp(Dn， dn+1) 














G1e Sy111Dlectic 11etjods of order |. 


Proojf We consider only the method to the left of (3.1). Differentiation with respect 
to (pn, qn) yields 


了 汪 几 万 六 0 D(Pn+l， dn+1) 全 了 一 彤 刀 ov 
一 app 工 Opn, an) 0 AH/ 
Where the matrices 刀 wo, 如 pp .. .ofpartial derivatives are all evaluated at (pn+1, 9n). 
This relation allows us to compute 2 


way the symplecticity condition (于 汪 全 和 | 2 一 了 


and to check in a straightforward 

















The methods (3.1) are implicit for general Hamiltonian Systems. For Separable 
瑟 (p,9q) = 了 (D) 十 V(q), however, both variants turn out to be explicit. It is inter- 
esting to mention that there are more general Situations where the Symplectic Euler 
methods are explicit. If, for a suitable ordering of the components， 
0 万 SR 
5 人 d) does not depend on py for7 过 沁 (3.2) 
你 
then the left method of (3.1J) is explicit, and the components of pn+1 can be com- 
puted one after the other. 于 , for a possibly different ordering of the components， 
0 万 


三 d) does not depend on gj for7 过 (3.3) 


then the right method of (3.1J) is explicit. As an example consider the Hamiltonian 


1 王 
刀 (prpeoym op) 一 了 人 十 7 2 一 rcosp 十 人 一 1 


which models a Spring pendulum in polar coordinates. For the ordering 2 < 7， 
condition (3.2) ls fulfilled, and for the inverse ordering 7 < w% condition (3.3). Con- 
Sequently, both Symplectic Euler methods are explicit for this problem. The methods 
Temain explicit if the conditions (3.2) and (3.3) hold for blocks of components in- 
stead of single components. 

We consider next the extension of the Stormer-Verlet Scheme (I.1.17), consid- 
ered in Table I 工 .2.1. 


Theorem 3.4. 77Pe S1Ormaze/ 一 Verlet scpe11aey (1.7.77) 


玫 
PPn+l/2 二 2 一 本 (pn+l/2,gn) 


尺 
dmn 十 了 ( 瑟 (pnray2 gn) 十 配 (paaj2gn) (3.4) 


dm 十 1 





尺 
Dmnp+l 二 Dn+1/2 一 了 To(Pn+1/2，gn+1) 
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CQG11G 
及 
dmn+1/2 二 dmn 十 林 1o(pw n+l/2) 


几 
Dmnp+l 二 Dmn 一 了 ( 囊 (@mw， dn+1/2) 十 五 p(Pn+l1， mw4a/a)】 (3.3) 





必 
dn+1 三 qn+1/2 十 了 妃 g(Pn+l， dm 二 1/2) 
G1e Sy111Dlectic 11etjods of order 2. 
Prooj This is an immediate consequence of the fact that the Stormer-VYerlet Scheme 


is the composition of the two Symplectic Euler methods (3.1). Order 2 follows from 
its Symmetry. 














We note that the Stormer-Verlet methods (3.4) and (3.5) are explicit for Separa- 
ble problems and for Hamiltonians that satisfy both conditions (3.2) and (3.3). 


Theorem 3.S，77Pe iiPLicit aidpoimt Fle 
or 三 名 十 PTIV((nsi 二 如)/2) (3.6) 
18 QSy11Dlectic 11etpod of order 2. 


Prooj _ Differentiation of (3.6) yields 


(= 人 (光志 ) = (7+ 二 7-1V? 甩 )， 


Again it is straightforward to verify that ( 了 人 人] 一 ./. Due to its Sym- 
metry, the midpoint rule ls known to be of order 2 (See Theorem II.3.2). 














The next two theorems are a consequence of the fact that the composition of 
Symplectic transformations is again Symplectic. They are also used to prove the 
existence of Symplectic methods of arbitrarily high order and to explain why the 
theory of composition methods of Chapters II and III is So important for geometric 
integration,. 


Theorem 3.6. ZLel Oh de1ote 加 e 8Sy11Plectic EU1er 11etpod (3.1) 7T1e1 1je co11iDo- 
Si1io1 11e1pod (11.4.6) 18 8Sy111Dlectic Jor every cpoice or1je paramtetersy ai 太 . 

矿 2 18 Sy111DUectic G12d Sy111111e1ric (e.8.，1e 10PLicil 1IidpoIat FUe or 加 e 
1O711e1 一 Ye1ietf scpe11e) te1l 1je co11Dositio1l 11etpod (了 了 8) 18 Sy1D1ectic 1o0. 














Theorem 3.7. 4ssz1re 1Pat pe Earailionian give py 五 (V) = 万 (yy) 十 五 2(y)， 
G1C co11sider 1Pe SPDLLLI18 


= JIVT 厅 (人 = .YY 太 (JW 上 +.JIVPE2()， 











7Tje SPUi11i18 1Ietpod (11.3.0) 158 加 e1 Sy1IDUectic. 
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VI.4 Symplectic Runge- 下 utta Methods 


The systematic study of Symplectic Runge-Kutta methods started around 1988, and 
acomplete characterization has been found independently by Lasagni (1988) (using 
the approach of generating functions), and by Sanz-Serna (1988) and Suris (1988) 
(using the ideas of the classical papers of Burrage 儿 Butcher (1979) and CrouzeiX 
(1979) on algebraic stability). 


VI.4.1 Criterion of Symplecticity 


We follow the approach ofBochev 儿 Scovel (1994), which is based on the following 
important lemma. 


Lemma 4.1.， Fopr Rz1zge-Kxtta 171etpods da12d Jor partifioned Rz11ge-KUtta 1711etpods 
1je Jo1iowizg diagra11z coO1I111U1eS- 


= VO0= 一 


| menod | menod 


{2n} ES {yn 


(Porizo1tal a17ows 1ea11 0 di1erentiation with 7espectio yo) 7T1erejore, 1Pe 1U11e 广 
icaQL 1esuULt yn opiaied Jromz apPp1ying 加 e 11etpod lo 1pe proplez aU811e1lted Dy 
is Variatioal egUatiom 1 egual 1o 加 e 11U1aierical solution Jory = UV) au8g7ae1tted 
By is derivalive = yn /600. 


Prooj The result is proved by implicit differentiation. Let us illustrate this for the 
explicit Euler method 
yn+1l 三 gr 十 几 Fon)， 

We consider yn and yn +l as functions of yo, and we differentiate with respect to V0 
the equation defining the numerical method. For the Euler method this gives 

Own+1l On On 

-一 十 Am) 一， 

ay 6w aywo 

which is exactly the relation that we get from applying the method to the variational 
equation. Since 9yo/6ywo = 了 wehave 9n/8yo = 办 forall7， 














The main observation now ls that the Symplecticity condition (2.0) is a quadratic 
first integral of the variational equation: we write the Hamiltonian System together 
with its variational equation as 


六 = .三 LV 厅 (W， 下 = .三 1V2 厅 (yy) 亚 . (4.1) 


It follows from 
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(JIV2 厅 (人 到 )7 7 严 十 歼 5 JCYV2 厅 ( 罗 ) =0 


(See also the proof of Theorem 2.4) that V7 JJV is a quadratic first integral of the 
augmented System (4.1). 

Therefore, every Runge-Kutta method that preserves quadratic first integrals, ls 
asymplectic method. From Theorem IV.2.1 and Theorem IV.2.2 wethus obtain the 
following results. 











Theorem 4.2. 77Pe Gauss collocationm 1ietpods ofsect 下 1.3 a1e sy1ID1ectic. 





Theorem 4.3. 矿 ipe coe 访 cie111sy of a Ru1ge-Kztta 11etpod satis 记 


biai7 十 Da5 一 g Jor QULL 2 7 二 1， .5) (4.2) 











1je1 it13 Sy12DLectic. 





Similarto the situation in Theorem V.2.4, diagonally implicit, symplectic Runge-- 
Kutta methods are composition methods. 


Theorem 4.4. 4 diagomaliy iiPlicil Ra12ge-Kzttia 1Ietpod Satishing 1he 8y1IDPiec- 
1icipy cozdifion (和 .2) a1d 太 天 0058eguivalent to tpPe commposifiom 


MX RM MX 
oo9D5noEbh 
Wje1e @NM sta1ads Jorpe japlicit7midpointrale 


Poof Fori1i = 7 condition (4.2) gives uii = 玉 /2 and, together with aii =0 (or 
1 > 7), implies aij 三 0. This proves the statement， 














The assumption“ 访 和 0”is not restrictive in the sense that for diagonally im- 
plicit Runge-Kutta methods Satisfying (4.2) the internal stages corresponding to 
“一 0”do not infuence the numerical result and can be removed. 


To understand the symplecticity of partitioned Runge-Kutta methods, we write 
the solution of the variational equation as 


万 2 
人 2) 
Then, the Hamiltonian System together with its variational equation (4.1) is a parti- 
tioned system with variables (D,YP) and (q,9). Every component of 


到 7 = (ZP)7 9 一 ( 亚 ?)7ZP 


is of the form (LV.2.5), So that Theorem IV.2.3 and Theorem IV.2.4 yield the fol- 
lowing results. 











Theorem 4.S. 77Pe ZLopatto 1114 -11Bpair iasynplectic method. 
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Theorem 4.6. 矿 1Pe coe 太 cie1y of a Partifioned Rnge-Kx1ia 11etjpod (11.2.2) Sat- 

zs 廊 

ii 十 07Q 旋 一 忆 7 Jor 277 一 5 (4.3) 
太 Jori1 .8s， (4.4) 

1je1 it13 Sy111DLectic. 


矿 1pe Bamailtonian 六 oftpe Jo 万 (D,9) = 了 T(D) 十 VCq) ie it Separaple， 
1je1 1je co11ditio1l (和 4.3) Go1e 111D1ies 1Pe Sy11Dlecticiby of ipe 1U1Ierical jiow 














We have seen in Sect.V.2.2 that within the class of partitioned Runge-Kutta 
methods it is possible to get explicit, symmetric methods for separable Systems 7 二 
j 直 ,之 = 9). Asimilar result holds for symplectic methods. However， as in 
Theorem V.2.6, Such methods are not more general than composition or Splitting 
methods as considered in Sect. II.3. This has first been observed by Okunbor 
Skeel (1992). 


Theorem 4.7. Comsider Ga Pa1tiEio1zed Ru1a8ge-KUiia 1Iethoad Dased o1z 1Wo diago- 
1G11y jpPLicit 7etpods 仁 e, ai 三 0 三 0jori > assunre ai 0ii 三 0Jorall 
2 Gd apDPp1y 这 1 Q Sepa1aple 本 aazltomia1 syste1a Wi 太 万 (D,9q) = 了 (D) 十 D(g). 素 
(和 .3) 1o1ds, 1je1 1pe 171U1terical 7esuit 23 1pe Sa11e dy 11atf opiai11ed jjo1a 1Pe SPLttI118 
111e 太 oa (71.5.0). 

By (11.3.8) SUCcP a 11Ietpod 1 equivalent 1o G co11DOSi1io11 oj sy11Dlectic 尼 U1er 
3teD9. 


Pioof We first notice that the stage values ji = 2) (fori with ui = 0) and 
刀 三 9( 六 ) for with 已 一 0) do not infuence the numerical solution and can be 
removed. This yields a scheme with non-zero pi and 潜 but with possibly non-square 
matrices (aij) and (Qi7). 

Since the method is explicit for separable problems, one of the reduced matrices 
(aij) or (Qij) has a Tow consisting only of zeros. Assume that it is the first row of 
(aij), so that alj 一 0forall7.The symplecticity condition thus implies Qil 一 全 起 
0foraly > landoail 三 大 0for;y >2.Thisthenyields az 夭 0, because 
otherwise the first fwo stages of (Qi7) would be identical and one could be removed. 
By our assumption we get az2 一 0, 0i2 一 也 夭 0for1 >2,andaiz=bfori>3. 
Continuing this procedure we see that the method becomes 


5 加 5 加 5 六 总 岂 
5 5 bo 人 Dj 


where o and 2 are the exact flows corresponding to the Hamiltonians T(p) and 


LU(q), respectively. 














The necessity of the conditions of Theorem 4.3 and Theorem 4.6 for Symplectic 
(partitioned) Runge-Kutta methods will be discussed at the end of this chapter in 
Sect. VI.7.4. 
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A second order differential equation wy = 9(y),，augmented by its variational 
equation, ls again of this Special form. Furthermore, the diagram of Lemma 4.1 
commnutes for Nystrom methods, so that Theorem IV.2.3 yields the following result 
originally obtained by Suris (1988, 1989). 


Theorem 4.8. 矿 ijpe coe 访 cie1tsy of a Nys1rom1a 11etpod (7 2.77) yatisj 
记 ji 人 (一 ci) 人 
oO 一 aij) 一 bj(0Bi 一 aj) Jor 人 了 一 1 .. .3) 


加 e1l 13 8y1IDlectic. 


(4.5) 














VI.4.2 Connection Between Symplectic and Symmetric Methods 


There exist Symmetric methods that are not Symplectic, and there exist Symplectic 
methods that are not Symmetric. For example, the 1apezoidal rule 


几 =2+2(AGo+A) (4.0) 


is Symmetric, but it does not Satisfy the condition (4.2) for symplecticity. In fact， 
this ls true of all Lobatto IIJIA methods (see Example 1.2.2). On the other hand, any 
composition 2 on +72 = 了 Jofsymplectic methods is Symplectic but 
Symmetric only 让 y1 三 72. 

However for (non-partitioned) Runge-Kutta methods and for quadratic Hamil- 
tonians 万 (V) 三 57Cy (C is a symmetric real matrix)，where the corresponding 
System (2.9) ls linear， 

洲 = .1Cy， (4.7) 
we Shall see that both concepts are equivalent. 

A Runge-Kutta method, applied with step size 玉 to alinear System 7 三 ZW, is 


equivalent to 
纪 二 有 AL)yo， (4.8) 


where the rational function 尺 (z) is given by 
RR(z) =1 +z0(T 一 >4)-11 (4.9) 


4= (ai, 虹 = (band 了 = (1 1).The function RR(z) is called 
the siapizty jazzctiom of the method, and it ls familiar to us from the Study of sti 任 
differential equations (See e.g., Hairer 儿 Wanner (1990), Chap.IV.3). 

For the explicit Euler method, the Implicit Euler method and the implicit mid- 
point rule, the stabjlity function 尺 (z) is given by 


1 1 十 2z/2 
1 二 闵 1 一 z/2- 





1 工 十 2， 
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Theorem 4.9， For Raz1ge-KUta 1ietpods 加 he fo11owing state111e111s da1e egUivaLe1 
@ 1je 111etjhod 18 Sy11111et1jic Jor 1iear prople1yg 0 一 了 0 
e@ 1je 1ietpod is 8y11Dlectic Jor prople1ag (和 7 wii sy1t11ietric C; 
@ 1jpe stGDi1ipy jinction satisjies 民 ( 一 2)RR(z) 三 1 Jorall complex > 
Pioof The method Wi = 民 (P7D)yo is symmetric, 让 and only 让 yo = 民 ( 一 六 7)W 
holds for all initial values yo. But this is equivalent to 民 ( 一 j 厂 ) 民 (AD) 三 工 

Since G1 (yo) = 尺 (AD),symplecticity ofthe method for the problem (4.7) is de- 
fined by RPR-IO)7 RUT-IC) = JEForR(z) = P(z)/Q(z) this is equivalent 
to 

P(hJ-ICJTTJP7IC) = QIC) TO 一 IC). (4.10) 

By the symmetry of C, the matrix 取 := ,三 1C satisfies 77.7 = 一 JJ and hence 
also (7)7 = .7( 一 万 for 天 = 0,1,2,... .Consequently, (4.10) is equivalent to 


忆 ( 一 PIC)P(U7IC) =Q@(-77ICJQOPTIC)， 
which is nothing other than 尺 ( 一 PICJR(R7IC) = 工 














VI.S Generating Functions 


...by which the study of the motions of all free systems of attracting or 
Iepelling points is reduced to the search and differentiation of one central 
Telation, Or characteristic function . (W.R. Hamilton 1834) 


Professor Hamilton hat ... das merkwirdige Resultat gefunden, dass . . . 

Sich die Integralglelichungen der Bewegung ... Sammtlich durch die par- 

tiellen Differentialquotienten einer einzigen Function darstellen lassen. 
(C.G.J. Jacobi 1837) 


We enter here the Second heaven of Hamiltonian theory, the realm of partial dif- 
ferential equations and generating functions. The starting point of this theory was 
the discovery of Hamilton that the motion of the System is completely described 
by a“characteristic”function 9, and that 9 is the solution of a partial differential 
equation, now called the 瓦 azzaztltom 一 /acopi dijjerential eqgua1io1， 

It was noticed later, especially by Siegel (See Siegel 久 Moser 1971, 83), that 
Such a function 9 is directly connected to any Symplectic map. It received the name 
Se11e1Q1I118 12c1IO11. 


VI.S.1 了 Existence of Generating Functions 


We now consider a fxed Hamiltonian System and a fxed time interval and denote 
by the column vectors D and dg the ipztal valxesy D1... ,pdandal... ,qdattoofa 
trajectory. The jizazal values at are written as 书 and Q. We thus have a mapping 
(P;,dq) 上 (已 Q) which, as we know, is symplectic on an open set [7. 

The following results are conveniently formulated in the notation of differential 
forms. For a function 已 we denote by qd 有 = Pits (FrEcheb derivative. We denote 
by dd = (dal,... ,dao)z the derivative of the coordinate projection (P, q) F> 9. 
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Theorem S.1. 4 appizg P : (D;9) 一 (已 @) msSy1aplectic 矿 azd oly 矿 1Peye 
exists 1ocaliy a Jiazzctiomn 9(D, q) SucH tpPat 

P7 dg -六 do = d5. (5.1) 
7T7Jisy maea1sy 1pat PT dQ@ -DT do iatotaldjerentigL 


Proojf We split the Jacobian of po into the natural 2 x 2 block matrix 


SS 


JInserting this into (2.0) and multiplying out shows that the three conditions 
PIGQo=QIP， P Qu 一 T=Q7 Pu @I 忆 =PQ。 (5.2) 


are equivalent to SYymplecticity. We now insert dQ = padp 十 Quvada into the left- 
hand side of ($.1) and obtain 


dp QI 四 adD 
(0 PRO5e | | 四 [| 9 


To apply the Integrability Lemma 2.7,， we just have to verify the Symmetry of the 
Jacoblan of the coefficient vector， 


@ 乃 | 820@， 本 
人 QIL 户 2 ”D(p,9g)2 2 





Since the Hessians of Qi are Symmetric anyway, it is immediately clear that the 
Symmetry of the matrix (3.3) is equlivalent to the Symplecticity conditions (3.2). 














Reconstruction of the Symplectic Map from 9. UP to now we have considered 
all functions as depending on Dp and 9. The essential idea now is to introduce new 
coordinates; namely ($.1) suggests using z = (‰,Q) instead ofy = (2,q). This is a 
well-defined local change of coordinates y = 岁 (z) 让 P can be expressed in terms of 
the coordinates (q,@), which is possible by the implicit function theorem 让 党 1S 
invertible. Abusing our notation we again write 9(q,@) forthetransformed function 
9(U(2)). Then, by comparing the coefficients of ds9 = dd 十 2 dQ 
with (5.1), we arrive at 





D9 D9 
卫 = 一 (qOQ)， 三 一 一 (qqQ). 和 .4 
天 (和 Q@)， 了 7 页 (4Q9) (5.4) 
If the transformation (2,dq) 上 (已 @) is symplectic, then it can be reconstructed 
位 om the scalar function 9(q,Q@) by therelations (5$.4). By Theorem 5.1 the converse 


3 On the right-hand side we should have put the gradient VoS = (895/6Q@) We shall 
not make this distinction between Iow and column vectors when there is no danger of 
confusion. 
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is also true: azy SUHicientpy s1ioo 态 aad 1o1adege1ierate jzaction 9(q,Q@)“gene 广 
ales”Via (3. 和 GasSy1iplectic apPi18g (D;9q) 上 ( 忆 Q).This gives us apowerful tool 
for creating Symplectic methods. 


Mixed-Variable Generating Functions. Another often useful choice of coordi- 
nates for generating Symplectic maps are the mixed variables 1 d). For any Con- 
tinuously differentiable function 9 ( 乙 9) we clearly have d5 =- 如 汪 二 d 记 于 .5 32 7 d0. On 
the other hand, since d(P7Q@) = P7dQ@ +Q7IdP,the symplecticity 0 (3.1) 
can berewritten as QTdP+p7add= d(Q1P-5S) forsome function 9. Ittherefore 
follows from Theorem 5.1 that the equations 

69 05 


G = 5 人 9)， 了 一 画 岂 d) (3.5) 


define (locally) a symplectic map (Po) > (已 @) 让 ? 四 /0Paa is invertible. 


Example S.2. Let Q = X(d) be a change of position coordinates. With the gener- 
ating function S(P, q) = P7X(q) we obtain via (5.5) an extension to a symplectic 
mapping (pp, g) > (已 Q@).The conjugate variables are thus relatedbyz = X(9) 氏 书 . 


Mappings Close to the Identity. We are mainly interested in the Situation where 
the mapping (2,dq) 呈 (已 @) is closeto the identity. In this case, the choices (P,Q) 
or (P9) or ((P+D)/2,(Q 二 9)/2) of independent variables are convenient and 
lead to the following characterizations. 


Lemma S$.3. Zel (D;, 9g) 一 (已 @) pe as1ioo 纺 11ra1s1omaatiom close io 1pe identity 
1118 8Sy1Dlectic 矿 a1ad oly 矿 o1me of 1pe jilowiag co1aditions olds 1ocal1y: 


e。 QLdP+pdo=dPI9g+51) Jorsome jnction S1(P 9q); 
e Pde@+dodp=dpIQ@ -52) jorsome jnction 5S2(D,Q); 
。(@ 一 qd(P+ 凡 一 (下 一 站 QQ 二 gg) 一 209? 

Jpr some jinction 93(( 忆 十 D)/2,(@ 十 q)/2)， 





Proof The first characterization follows from the discusslon before formula (3.9) 让 
we put 91 such that P70 十 91 一 = 了 = Q@7L 忆 -9. Forthe second characterization we 
use d(pz9g) = pido+d7adp and the same arguments as before. The last one follows 
from the fact that (5.D) is eduivalentto (Q -do)7d(P+I--(P-D7ad(Q+o9) = 
d((P+D7(Q-9g) 一 29). 


The generating functions 91, 92, and 33 have been chosen such that we obtain 
the identity mapping when they are replaced with zero. Comparing the coefficient 
functions of da and dP in the first characterization of Lemma $.3, we obtain 














汪 
Q@ 三 9 十 


几 55 


DLP 9) (5.0) 
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Whatever the scalar function 9S1(P,dq) is, the relation (5$.6) defines a symplectic 
transformation (p, g) > (PQ). For 591(P,9) := / 厅 (Pg) we recognize the sym- 
plectic Euler method (1.1.9). This is an elegant proof of the Symplecticity of this 
method. The second characterization leads to the adjoint of the Symplectic Euler 
method. 

The third characterization of Lemma 5.3 can be written as 


忆 = 2p 一 0653((P+D1/2(Q+9/2)， 
Q@ = 4+653((P+D)/2(Q+9/2)， 


which, for 53 = /万 ,is nothing other than the implicit midpoint rule (L.1.7) applied 
to a Hamiltonian system. We have used the notation O1 and 02 for the derivative with 
resSpect to the first and Second argument, rfespectively. The System (3.7) can also be 
written in compact form as 








(3.7) 


了 =y+JIVS3(( 并 十 力 /2)， (5.8) 


where 节 = (PQ),y = (pq 53(w) = 93(0wu) with ww = (wu and yis the 
matrix of (2.3). 


VI.S.2 Generating Function for Symplectic Runge- 天 utta 
Methods 


We have just seen that all symplectic transformations can be written in terms of gen- 
erating functions. What are these generating functions for Symplectic Runge-Kutta 
methodsy The following result, proved by Lasagni in an unpublished manuscript 
(with the Same title as the note Lasagni (1988)), gives an alternative proof for The- 
orem 4.3. 


Theorem 5S$.4， SUPPose 太 Pat 
Diai 十 07Q7 一 忆 7 Jor CU 2 7 (3.9) 
(See 7Tjeoe11 4.3). 71e1 1je Ronge-Kutta 111etpod 
书 =D 一 疡 》 广 BCPQi)， 玉 =2p 一 疡 》 aiBo(PQ@)， 
JJ=1 


2 一 1 


S 8 (3.10) 
Q@=4+ 六 >》 方 有 (PQi)， Qi = 二 >》 aijB( 记 ,GQj) 


1 一 1 JJ 一 1 


Ca1 De W1itte1 ay (3.0) mwit 太 


SPd 阅 = 六 》 5 末 (PGi 一 忆 》 baj 囊 (8 丽 (PQj) (05.1D) 
7 


熙 站 二 寺 
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Proof We first differentiate S1(P, q,]) with respect to q. Using the abbreviations 
互 中 三 万 (Pei 本 加 = 囊 (PoQi), .weobtain 


呈 号 加】 一 于 (人 = "于 贡品) 
+ 六 5 有 (+ oj 基肥 四 
外 了 
With 人 
5 “全 75 


(this is obtained by differentiating the first relation of (3.10)), Leibniz rule 


D ， TO TO 
现 ( 辐 瑟 国 ) 一 砂 辕 讽 瑟 团 睹 厂 辐 页 画 轩 
and the condition ($.9) therefore yield the first relation of 


The second relation is proved in the Same way. This Shows that the Runge-Kutta 
formulas ($.10) are equlivalent to (3.0). 














It is interesting to note that, whereas Lemma 3.3 guarantees the 1ocal! existence 
of a generating function 91, the explicit formula ($.11) shows that for Runge-Kutta 
methods this generating function ls 81opaLiy deftined. This means that it 1S well- 
defined in the same region where the Hamiltonian 万 (2, 9q) is defined. 


Theorem S.S. 4 Pa71i1io1ed Ru128e-KU1ta 11etjpod (11.2.2) Salisjhi18g 1Pe yy11Dlec- 
1zcity CO1G111O18 (4.3) G11G (4.4) 713 equivaleat 1o (5.0) Wi 太 


ST( 己 0 站 = 六 》 有 (PP Qi 一 忆 光 5 有 (Po QT7B(CPG 让 ， 
[2 汪汪 


矿 1pe Baailtonian 六 oftpe jorma 万 (D,9) = 了 T(D) 十 DT(q) ie ii Separaple， 
1 太 e1l 1 太 e co1difio1 (4.3) Go11e 111DPliesy 加 at 加 e 11etpod isy ofipe jor1a (5.0) Wi1 


S1(P 9 Ja (olol )+2T(CP)) - 悦 和 0 站 


站 











Prooj This is a straightforward extension of the proof of the previous theorem. 
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VI.S.3 TIhe Hamilton-Jacobi Partial Differential Equation 


We now return to the above construction of 
9 for a symplectic transformation (P,q) 哺 > 
(已 @) (see Theorem 5.1). This time,，how- 
ever，we imagine the point 己 ( 攻 ,QU to 
move in the fow of the Hamiltonian System 
(1.7). We wish to determine a Smooth generT- 
ating function 9(q,Q@, 妇 ,now also depending 
on 上 which generates via (3.4) the Symplectic 
map (2;,9g) 上 (P(,Q() of the exactJiowm 
of the Hamiltonian System. 

In accordance with equation (S$.4) we 
have to Satisfy 














O59 
(5.12) 
Differentiating the Second relation with respect to 上 yields 
225 0 
SR (GO 汶 十 507 (4 Q@O 蚊 .GjO (5.13) 
D29 吧 D2 9 万 
= (QQ 人, 十 2 50 (4 Q@ 扩 : 现 (P(,QO) (5.14) 


where we have inserted the Second equation of (1.7) for Q 刻 Then, using the chain 
rule, this equation Simplifies to 


DO /09 ( 0O9 09 
00 0 OoQ1 “0oQv 


This motivates the following surprisingly simple relation. 





二 一 0. (5.15) 


Theorem S.6. 矿 9(qQ@ 加入 as1oo 太 solution ortpe Partial derential equation 


D9 [ DO9 
of  \ 9 09。 
Wi 贡 GUO vaG1ues Sa1ighizg 努 (q90) 十 劳 (490) 一 0，a1d 矿 1pe 1Ia1rix 
(型 东 ) 1 ivertiple 1hen 1pe map (29) 王 (Pb@O) dejired Dy (5.712) 妆 
1jpe Jow pt(p;,d) oftpe Damailtomia1 syste1a (7.7) 

EGguatio1 (3.76) 88 caied 11e “Faizlio1 一 /acopi pa1tial di1eretial eqgUatiom 





0 一 0 (5.10) 


4 Carl Gustav Jacob Jacobi, born: 10 December 1804 in Potsdam (Cnear Berlin), died: 18 
February 1851 in Berlin. 
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Prooj The invertibility of the matrix (5 六) and the implicit function theorem 
了 


imply that the mapping (2,q) 一 (P(b,QO) is well-defined by (5.12),， and, by 
differentiation, that ($.13) ls true as wel]. 

Since，by hypothesis，95(q,Q@ ,bis a solution of (5.16), the equations (3S.15) 
and hence also (3.14) are Satisfied. Subtracting (3.13) and (3.14), and once again 
using the invertibility of the matrix 人 隐 坟 7 ，We See that neceSsSarily Q@ 人 
机 (P(,QO).This proves the validity of the second equation of the Hamiltonian 
System (1.7). 

The first equation of (1.7) ls obtained as follows: differentiate the first relation 
of (5.12) with respect to 上 and 了 Hamilton-Jacobi equation (3.16) with respect 
to Qi, then eliminate the term 访 已 Using @ 人 ( = 机 (PPO,QO),this leads in 
a straightforward way to 书 ( 伯 = 一 瓦 xi(P(,QO). The condition on the initial 
values of 9 ensures that ( 忆 (0),Q@(0)) = (P, 9). 




















In the hands of Jacobi (1842), this equation turned into a powerful tool for the 
analytic integration of many difticult problems. One has, in fact, to find a solution 
of (9.16) which contains suffticiently many parameters. This ls often possible with 
the method of separation of variables. An example is presented in Exercise 11. 


Hamilton-Jacobi Equation for S1,，9S2, and S3. We now express the Hamilton- 
Jacobi differential equation in the coordinates used in Lemma 5.3. In these coordi- 
nates it is also possible to prescribe initial values for 9 att = 0. 

From the proof of Lemma 53.3 we know that the generating functions in the 
variables (qd,Q@) and ( 书 ,a) arerelated by 


SI(Pdq 昌 = PIT(Q-9g) -5S(00Q,D. (5.17) 


We conslider 己 , qtas independent variables, and we differentiate this relation with 
respect to 上 .Using the first relation of (3.12) this gives 


D51 DTOQ 05 0Q 09 .09 
到 (0= 了 了 赤 一 了 天 (4， 六 南 一 率 (9， 六 = 一 责 (9， 
Differentiating (5.17) with respect to 书 yields 
D51 局 7 六 OQ OO 
an 溃 -区 20 芝 =Q@ 一 (4. 


JInserting 编 一 王 andQ = 三 1 + 足 * into the Hamilton_Jacobi equation (3.16) we 
are ledto the equation of the 0 theorem. 


Theorem 5$.7. 矿 S1(P9iaso1ution oftpe partial djferential eguatiomn 


0 间 员 


加 el tpe 1aiapping (D,9) 证 (P(b， Q@())， dejiped py (5.6) 1 1pe exacf jiow or tpe 
已 Ga11zzliomia1z yyste110 (7.]) 
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Proof Whenever the mapping (p,9) 一 ( 忆 ( 雪 ,QI) can be written as (5.12) with 
a function 9(q,Q, 加 ,and when the invertibility assumption of Theorem 5.6 holds， 
the proof is done by the above calculations. Since Our mapping, for 上 三 0, reduces 
to the identity and cannot be written as (3.12), we give a direct prootf. 

Let S1(P, db be given by the Hamilton-Jacobi equation (5.18), and assume 
that (2,9) 忆 (PPG) = (PW,QO) is the transformation given by (5.6). Differen- 
tiation of the first relation of ($.6) with respect to time ti and using ($.18) yieldss 


D25- 


8251 8 
一 可 0 


D251 ， 


Differentiation of the second relation of (3.0) gives 


. D251 D251 . 
D 万 D251 D 万 . 
加 证 (9 二 5 思 可 二 
Consequently, 书 = 二 5 Q) and Q = 强 (PG9)， so that (PP 人 ,QO) = 














oz(D;,dq) is the exact flow of the Hamiltonian system. 


Writing the Hamilton-Jacobi differential equation in the variables ( 尸 十 D)/2， 
(@Q +a)/2 gives the following formula. 


Theorem 5S.8. 4sszmae 1Pat 93(uv 和 aso1ution oF 


3 3 3 
人 如 (一 二 下 
人 2 0 


六 (wz 胃 】 (5.19) 


Wi ipitial condifion 93(uwU;0) = 0. 7T1em 1Pe exact jiom pi(D;9g) of zhe am 让 
1O11G1 SySte111 (7.7Z) Satisjies 加 e yyste111 (5.1 


Pioof As in the proof of Theorem 5.7, one considers the transformation (P,q) 哺 
(P(D,QO) defined by (5.7), and then checks by differentiation that ( 忆 ( 坟 ,Q OO) 
is a solution of the Hamiltonian System (1.7). 














Writing ww = (wuv) and using the matrix .J of (2.3), the Hamilton-Jacobi equa- 
tion (9.19) can also be written as 


2 = 三 (w+ JIVSau, 人 )， Sa(u;0) = 0， (5.20) 

The solution of (5.20) is anti-sSymmetric in 万 1.e.， 
S3(w, 一 四 = 一 93(uw 人 加 . (3.21) 
5 Due to an inconsistent notation of the partial derivatives 尖 ， as column Or TOW Vec- 


tors, this formula may be difficult to read. Use indices instead of matrices in order to check 
its coOrrectneSS. 
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This can be seen as follows: let pi(w) be the exact flow of the Hamiltonian system 
0 = .JIV 万 (四 Because of (3.8), 93(w,b is defined by 


ot(uw) -= .JIVS3((os(w) 二 WO)/2, 刘 . 
Replacing 上 with 一 : and then w with wx(w) we get from p_i(pr( 鸭 ) = that 
一 9i(u) 三 JJ VS3((w 十 oz(w))/2， 一 四 . 


Hence 5S5(uw, 加 and 一 53(w, 一 加 are generating functions of the same Symplectic 
transformation. Since generating functions are unique up to an additive constant 
(because d9 = 0 implies 9 = Cozsi, the anti-symmetry (3.21) follows from the 
initial condition 93(w, 0) = 0. 


VI.S.4 Methods Based on Generating Functions 


To construct Symplectic numerical methods of high order, Feng Kang (1986), Feng 
Kang, Wu, Qin & Wang (1989) and Channell 儿 Scovel (1990) proposed computing 
an approximate Solution of the Hamilton-Jacobi equation. For this one inserts the 
anSatZ 

SI(Pd =tGi( Pd+iGaCPq)+1GsCPa) 十 .… 


into (3.18), and compares like powers of 上 . This yields 
CI1OP,9) 一 五 ( 乙 ,9)， 


9 


Go2(P9) = 慌 了 开 页 





1 7 六 让 /8 站 本 02 厅 80 站 让/0S 
人 亲 (让 ( 史 ) 卡 囊 古 末 古 届 7 1 )g 


Hwe use the truncated series 
SP = PhGIPd+12Go(PO+: +1G Pdg) (3.22) 


and insert it into (3.6), the transformation (P, 9q) 一 ( 己 Q@) defines asymplectic one- 
step method of order 7. 9Symplecticity follows at once from Lemma 3.3 and order 7 
is a consequence of the fact that the truncation of 91(P, q) introduces aperturbation 
of size O(Pr+l) in ($.18). We remark that forr > 2 the methods obtained require 
the computation of higher derivatives of 万 (p,q),， and for separable Hamiltonians 
万 (pp,9q) = 了 (D) 十 V(q) they are no longer explicit (compared to the Symplectic 
Euler method (3.1)). 
The same approach applied to the third characterization of Lemma 5.3 yields 


S3(u, 四 = PPGi(o) 二 MGa() 十 .十 12IG2 (wo)， 


Where G1(w) = 万 (由 )， 
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1 
Ga(w) = 页 VE(o(7IVYE(o, YE(w)， 
and further Cj(w) can be obtained by comparing like powers of 尹 in (3$.20). In this 
way We get Symplectic methods of order 27. Since 93(uw, 九 ) has an expansion in 
odd powers of A, the resulting method is symmetric. 


The Approach of Miesbach & Pesch. With the aim of avoiding higher derivatives 
of the Hamiltonian in the numerical method， Miesbach 贸 Pesch (1992) propose 
considering generating functions of the form 


S3a(uw, 有 站 一 六 已 吾 (u 下 per7mIVE(w))， (5.23) 


we 


and to determine the free parameters bi ci in Such a way that the function of (5.23) 
agrees with the solution of the Hamilton-Jacobiequation (3.20) up to a certain order. 
Forbs 三 iandcs+l ;= 一 cithis function satisfies 3953(w, 一 j) 一 一 S3( 万)， 
Sothatthe resulting method is symmetric. A straighttorward computation shows that 
it yields a method of order 4 让 


S 豆 1 


For s = 3, these equations are fulfilled for 1 = 0%3 一 5/18, 0 = 4/9, cl = 一 c3 三 
V15/10, and cz = 0. Since the function 33 of ($.23) has to be inserted into (5.20)， 
these methods still need Second derivatives of the Hamiltonian. 


VI.6 Variational Integrators 


Athird approach to Symplectic integrators comes from using discretized Verslions 
of Hamilton's principle，which determines the equations of motion from a varia- 
tional problem. This route has been taken by Suris (1990)， MacKay (1992) and 
in a series of papers by Marsden and coauthors，see the review by Marsden 妇 
West (2001) and references therein. Basic theoretical properties were formulated 
by Maeda (1980,1982) and Veselov (1988,1991) in anon-numerical context. 


VI.6.1 Hamilton's Principle 


Ours, according to Leibniz, is the best of all possible worlds, and the laws 
of nature can therefore be described in terms of extremal principles. 
(C.L. Siegel 儿 J.K. Moser 1971, p. 1T) 


Man scheint dies Princip fruher .… unbemerkt gelassen zu haben. 
瑟 amzilto ist der erste, der von diesem Princip ausgegangen ist. 
(C.G.J. Jacobi 1842, p. 58) 
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Hamilton gave an improved mathematical formulation of a Principle 
which was well established by the fundamental investigations of Euler 
and Lagrange; the integration process employed by him was likewise 
known to Lagrange. The name“Hamilton's principle”", coined by Jacobi， 
Was not adopted by the scientists of the last century. It came into use， 
however, through the textbooks of more recent date. 

(C. Lanczos 1949, p. 114) 


Lagrange's equations of motion (1.4) can be viewed as the Euler-Lagrange equa- 
tions for the variational problem of extremizing the actiozz zteg1al 


有 的 吕 / ED) 才 (6 


to 


among all curves d(t that connect two given points do and al1: 


d(to) 三 ao，d() =91 (6.2) 


In facb assuming d(b to be extremal and considering a variation qd 人 (加 十 sod 人 加 
With the same end-points, ie.， with bg9(to) = 0d( 三 ) = 0， gives, using a partial 
integration， 


d /67 7 2 797 dpr 
o= 攻 srci0= 人 ( 吉 20+ 贡 罗 w= 上 人 (二 二 二 员 ) 交 着 


which leads to (1.4). The principle that the motion extremilzes the action integral ls 
known as 瑟 a1zazz11om sy PFCciPLe. 

We now consider the action integral as a function of (ao,qij, for the solution 
dt of the Euler-Lagrange equations (1.4) with these boundary values (this exists 
uniquely locally at least 过 ao, gl are Sufficiently close)， 


这 / 全 (63) 


to 


The partial derivative of 9 with respect to qo is, again using partial integration， 


5 arp5 8 Dr 0 
= 是 | 作 囊 全 ) 寺 








oo od pq 66a 
Oo16g 176 dr ， 6 . 
-有 直人 ( 击 一 吾 二) 站 4 -再 @ 训 





with do = d(to), where the last equality follows from (1.4) and (6.2). In view of the 
definition (1.5) of the conjugate momenta, pp = 907/6d,the lastterm is simply 一 20. 
Computing 929/6al = pl in the same way, we thus obtain for the differential of 9 
D9 DO9 
d9 = 坟 -dql 十 二 ddqo 三 Dldal 一 podao (6.4) 
oa oao 
Which is the basic formula for Symplecticity generating functions (See (3.1) above)， 
obtained here by working with the Lagrangian formalism . 
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VI.6.2 Discretization of Hamilton's Principle 


Discrete-time versions of Hamilton's principle are of mathematical interest in their 
own right, see Maeda (1980,1982), Veselov (1991) and references therein. Here they 
are considered with the aim of deriving or understanding numerical approximation 
Schemes. The discretized Hamilton principle consists of extremizing, for given do0 
and gNv, the Sum 


Sn({dn]0) 工 P(dn n+1) (0.3) 


We think of the discrele Lagramgia1 ZL as an approximation 


妃 相 
站 汪 才 人 六 全 (6.0) 
志 


也 


where d(b is the solution of the Euler-Lagrange equations (1.4) with boundary 
values d( 刀 ) = qd(t 如 1) = dnf+l. 开 equality holds in (6.6), then it is clear 
from the continuous Hamilton principle that the exact solution values {a(t)} of 
the Euler-Lagrange equations (1.4) extremize the action Sum oj. Before we turn 
to concrete examples of approximations 万 ，we continue with the general theory 
which is analogous to the continuous case. 

The requirement 90sj/6d， = 0 for an extremum yields the discrete Eule 盖 
Za810118e eGIUG1IO118 


D7 DO7 

一 人 \ 人 dm 一 1 dm 一 \ 人 dm dm ss 0 0.7 

5 全 1 gn) 十 (ngn+1) (6.7) 
for7m = 1 ...,N 一 1 where the partial derivatives refer to 7 三 Zn(zZ,V). This 
gives a three-term difference Scheme for determining q1,... ,qqN 一 1. 


We now Set 


)(d0， dN) 汪 也 六 dm， dan+1) 
Where {dn} is a solution of the discrete Euler-Lagrange equations (6.7) with the 
boundary values do and gw. With (6.7) the partial derivatives reduce to 


83， _ mn 
aoo ”or 


90 “6 


Bow 二 机 (CCw-b4w) 2 


(ao, 01)， 


We introduce the discrete 1io11e1ta via a discrete Legendre transformation， 


_ 01 
pr 一 一 区 人 (won ， (6.8) 


The above formula and (6.7) for 史 = N then yield 


do =DPv addN 一 Do da0. (0.9) 
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开 (6.8) defines a bijection between pn and qn+l for given qn， then we obtain a 
one-step method 瑟 ，: (pn, dan) 上 (phi)qn+1) by composing the inverse dis- 
crete Legendre transform, a step with the discrete Euler-Lagrange equations, and 
the discrete Legendre transformation as Shown in the diagram: 


(6.7) 
(dan， dn+1) 二 (qn+1， dn+2) 
(6.8) | | (6.8) 
(pn dn) (Dn+1， dn+1) 


The method is Symplectic by (6.9) and Theorem 5.1. A Short-cut in the computation 
is obtained by noting that (6.7) and (6.8) (for 即 十 1instead ofm) imnply 


OF 
pn+1 一 刷 (oo ， (6.10) 


which yields the Scheme 


(6.8) (6.10) 
(pn dn) 二 人 (qu dgn+1) 一 (pn+l; dgn+1) . 


Let us Summarize these considerations, whlch can be found in Maeda (1980), Suris 
(1990), Veselov (1991) and MacKay (1992). 


Theorem 06.1. 77e discrete 有 Eaz1azltozz P1rinciPpie jor (0.5) givesy 1pe discrete Eu1e/ 一 
Za81a118e eqguUa1io18 (0.7) a11d 1Pe sy11Dlectic 1Ietpod 


OF DO 
pr 一 一 有 (gg Do = 一 全 (ggnHi)， (6.11) 
0 ODV 


These formulas also Show that Zn is a generating function (3.4) for the Sym- 
plectic map (pn an) 上 (pn+l;gn+1). Conversely, Since every Symplectic method 
has a generating function (3.4), it can be interpreted as resulting from Hamilton 's 
Principle with the generating function (5.4) as the discrete Lagranglian. The classes 
of symplectic integrators and variational integrators are therefore identical. 

We now turn to Simple examples of variational integrators obtained by choosing 
adiscrete Lagrangian ZWwith (0.0). 





Example 6.2 (MacKay 1992). Choose Zp(an,dqn+l) by approximating qd( 切 of 
(6.6) as the linear interpolant of qn and qn+1 and approximating the integral by 
the trapezoidal rule. This gives 





几 也 由 几 也 2 
有 (gu goH) = 一 和 十 了 (to (612) 


and hence the symplectic scheme, with w+1/2 三 (n+1l 一 an) 人 /for brevity， 
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1 07 1 07 灵 O7 
DPn 一 7 机 (dn, Vn+17/2) 十 7 页 (dan+L Vn+1/2) 一 了 5 二 (9n, UnT1/2) 
1 07 1 0 灵 O7 
DPn+1 一 7 页 (qdn, Vn+172) 5 聘 (n+ n+l/3) 十 了 5 二 一 (an+l)VUn+1/2) . 


For amechanical Lagrangian 了 (qd, 9) 一 二 9 ad 一 V(a) this reduces to the Stormer 
Verlet method 


1 
Jun+I172 二 Dn 十 Ar 


dm 十 1 Gan 十 几 Un+112 





工 
Dn+l 三 Mon+l/2 十 5P4n+i 


where 有 = --VU(qw). In this case, the discrete Euler-Lagrange equations (6.7) 
become the familiar second-difference formula MT(an+l 一 20 十 an 1) 三 万 2 严 ， 


上 xample 0.3 (Yendlandt & Marsden 1997). Approximating the integral in (6.0) 
instead by the midpoint rule gives 





7 填 允 也 ”dm 
Tp(gngnH1) 一 PE( 革 人 这 ) (6.13) 


This yields the Symplectic scheme, with the abbreviations qn+1 /2 二 (qun+1 十 an)/2 
and Vn 二 1/2 二 (dan+1 一 dn)/1， 














O7 灵 O7 

DPn 一 丽 (n+U2 mt/2) 一 255 (qdn+1/2，Vn1/2) 
O7 灵 O7 

DPmn+1 一 丽 (n+U2 ntL/2) 二 255 (qdn+1/2， Un+1/2) . 


For Z(a,o) = 二 0 4 一 V(a) this becomes the implicit midpoint rule 


1 
AUn+l72 二 Dmn 下 本 /An+172 





dn+l 二 4dn 下 jn 二 112 





1 
Dn+l 三 Mun+l/2 十 5An+1/2 
with Pi 一 一 VIDEO 二 gm) 


VI.6.3 Symplectic Partitioned Runge- 下 utta Methods Revisited 


To obtain higher-order variational integrators, Marsden 久 West (2001) consider the 
discrete Lagrangian 


An(do, 91) 422ozlv ,aci) (6.14) 
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Where (区 is the polynomial of degree s with v(0) = go, (PP) 三 91 which ex- 
tremizes the right-hand slide. They then Show that the corresponding variational in- 
tegrator can be realized as apartitioned Runge-Kutta method. We here conslider the 
slightly more general case 


古 (go0) = 六 》 biFE(Qi Qi) (6.15) 
i1 
where 5 
Ci 三 90 十 凡 》， ai7GO 
J=1 


and the Q; are chosen to extremize the above Sum under the constraint 


和 一 0 十 刀 》 Qi 
一 1 
We assume that all the w are non-zero and that their sum equals 1. Note that (6.14) 
is the Special case of (6.15) where the aij and pi are integrals (IIL.1.10) of Lagrange 
polynomials as for collocation methods. 


With a Lagrange multiplier 入 = (Xi,...,Xz) forthe constraint the extremality 
conditions obtained by differentiating (6.15) with respect to @j for 7 三 1 .5， 
read 

二 ， 87 97 : 
55Q5 Ci)Paij 十 六 页 (95， GJ) 三 几 和 ， 
With the notation 
:01 。 D7 ， 
五 三 GO) 万 = 有 (ii 0.10 
元 (rnO 起 (@O0 (6.10) 
this simplifies to 
i1 


The symplectic method of Theorem 6.1 now becomes 


7 
0 一 一 责 (9 
SR 0 
三 0 人 
一 3 
一 


In the last equality we use (6.17) and 几 2 56Q; /6ao = 一 六 which follows from 
differentiating the constraint. In the same way we obtain 
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O7 
内 二 责 (o0) 三 入 . 


Putting these formulas together, we see that (D1, 91) result from applying a parti- 
tionedRunge-Kutta method to the Lagrange equations (1.4) written as a differential- 
algebraic System 





5 丈 (用 ，p= 贡 @9， (618) 
That ls 
1 一 Do 二 六》 5 户 ， 4 = 二 55Qi， 
二 (6.19) 
玉 =po+7>》 05 方 ， Qi = 0 十 姑 iaz5Gj， 
| 


with ij 三世 一 aii/b so that the symplecticity condition (4.3) is fulftilled, and 
with 万 ,Qi 已， Qi; related by (60.16). Since equations (6.10) are of the Same form as 
(6.18), the proof of Theorem 1.3 Shows that they are equivalent to 


0 万 。 0 万 

玫 生 一 二 一人 要 天 和 (人 区 0.20 

区 (0 ，d= 机 (0 (620) 

with the Hamiltonian 刀 = p70 一 Zoo) of (1.6). We have thus proved the follow- 
ing, which is similar in Spirit to a result of Suris (1990). 


Theorem 6.4.， 77Pe yariatiomal ntegrator miil 1pe discrete ZLagra1giamn (6.715) 1 
eqgUiVGLe1t 1o 1je Sy11Dlectic PaFtIIO1eG Ru1128e-KU1ta 11ethod (6.79) (6.20) GDPPlied 
1 妨 e 矶 Ga111ilfo11ia11 8Syste1a Wi8 太 加 e 瓦 az1atzlio1ia11 (7.0). 














In particular as noted by Marsden 儿 West (2001), choosing Gaussian quadrature 
in (0.14) gives the Gauss collocation method applied to the Hamiltonian System， 
while Lobatto quadrature gives the Lobatto IJIA - JIB pair. 


VI.6.4 Noether”s Theorem 


... enthalt Satz Ialle in Mechanik u.S.w. bekannten Satze Uber erste In- 
tegrale. (上 . Noether 1918) 


We now return to the Subject of Chap.IV, ie., the existence of first integrals，but 
here in the context of Hamiltonian Systems. 上 . Noether found the Surprising result 
that continuous yy11111etyriesy in the Lagrangian lead to Such first integrals. We give 让 
the following a version of her “Satz [, Speciallized to ou needs, with a particularly 
Short prootf. 


Theorem 6.S (Noether 1918). Comsider a syste1a wiiU Bamailionian 万 (P,9) amd 
ZaS1a118ia1 了 (qd). Suppose {gs :5 E 月 ja one-paraieter 81otpD of tra1sjo 广 
1IQ11O118 (gs o gr 一 9gs+r) WPich 1eayves 加 e ZLa81G1181G1 10VG1IG1L 
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(gs(9),gs(g)g) 一 世 0 人 Joralsandal (gg (6.21) 
Zetf a(q) = (qd/ds)j|。=ogs(dq) pe dejizaed as te vectorjield wii jiow gs(q). Ten 
TCD,g) = 三 a(g) (6.22) 
18 Q Jst iteg1al of tpe 瑟 a11ailtomia12 8SySte110. 
了 xample 6.6. Let Cbe amatrix Lie group with Lie algebra g (See Sect.IV.6). Sup- 
pose F(Q@aQo) = ZooforalQ es G.Then p749is a first integral for every 


4eg.(Take 9qs(dq) =exp(s4)d.) For example, G = 9O(m) yields conservation of 
angular momentum. 


We prove Theorem 6.5 by using the discrete analogue, which reads as follows. 


Theorem 6.7. SuPPpose 加 e o1e-pa1a1aeter 810UD of tra1ts1ormaiatiolsg {gs : sS E 及 } 
1eaves 如 e discyele La8g1a118ia1z 了 jn(d0, qi1) DovG7ia1t: 


Zpn(gs(q0),gs(q1)) = Th(qo,91) Jora1lsand al (9qo,91). (60.23) 
7T1en (6.22) 18 0 Jst Integral ortpe metpod (6.77) Le， DZ 1a(qn+l) 一 DLa(dn). 
Prooj_ Differentiating (0.23) with respect to s gives 


d DO7 DO7 
0= 丙 ls(o),o()= 页 (oalo)+ 机 (dao 














By (6.11) this becomes 0 = -2d a(dao) + 2ta(ai). 


Theorem 6.3 now follows by choosing 六 三 9 of (6.3) and noting (6.4) and 


stoat) = 人 ze,6) 


to 


=- 三:(oGo) Sa))u=s(oGtojolto)) 


to 


Theorem 6.7 has the appearance of giving a rich Source of first integrals for Sym- 
plectic methods. However it must be noted that, unlike the case of the exact fow 
map in the above formula, the invariance (0.21) of the Lagrangian 了 does not in 
general imply the invariance (0.23) of the discrete Lagrangian ZP of the numerical 
method. A noteworthy exception arises for linear transformations 9gs。 as in Exam- 
ple 6.6, for which Theorem 6.7 yields the conservation of quadratic first integrals 
D7L4d, such as angular momentum, by symplectic partitioned Runge-Kutta methods 
一 aproperty wealready know from Theorem IV.2.4. For Hamiltonian Systems with 
an associated Lagrangian 了 (qd, 9) 一 二 7 AM 4 一 UV(a,allfirst integrals originating 
from Noether's Theorem are quadratic (See Exercise 13). 
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VI.7 Characterization of Symplectic Methods 


Up to now in this chapter, we have presented sufficient conditions for the Symplec- 
ticity of numerical integrators (usually in terms of certain coeffcients). Here，we 
will prove 1ecessa1y conditions for Symplecticity,， li.e.，answer the question as to 
which methods are of Symplectic. It will turn out that the sufticient conditions of 
Sect.VI4, under an irreducibility condition on the method, are also necesSary. The 
main tool is the Taylor series expansion of the numerical How yo 呈 2z(yo), which 
we asSume to be a B-series (Or a P-series). 


VI.7.1 B-Series Methods Conserving Quadratic First Integrals 


The numerical solution of a Runge-Kutta method (LU.1.4) can be written as 3a 
B-series 辣 
几 肝 
邦 =B(owoyo)=% 二 >》， 5 “JR(o) (0.D) 
TE 人 


with coefficients w(7) given by 
alT)=》 bigi(r) for 7ET (7.2) 
放 1 


(See (LUI.1.16) and Sect. JU.1.2). Our aim ls to express the sufficient condition for 
the exact conservation of quadratic first integrals (which is the Same as for Symplec- 
ticity) in terms of the coefficients w(7). For this we multiply (4.2) by gi(z gj) 
(where = [wu ,um]andv = [ouUlaretrees in7) and wesum overall; 
and7. Using (HUUI.1.13) and the recursion (JU.1.15) this yields 


> oswoa + oo 区 bgi(w】 代 55jgi(o))， 


where we have used the Butcher product (See, e.g., Butcher (1987), Sect. 143) 
4& ou 一 [WUmyz]， uUou 一 |. (7.3) 
(compare also Definition HI.3.7 and Fig.7.1 below). Because of (7.2), this implies 
aluoo)+aluo= 一 au) au) for 小 VE 了 (7.4) 


We now forsget that the B-series (7.1) has been obtained from a Runge-Kutta 
method, and we ask the following question: is the condition (7.4) sufficient for a 
B-series method deftined by (7.1) to conserve exactly quadratic first integrals (and 
to be Symplectic)? The next theorem Shows that this is indeed true, and we Shall see 
later that condition (7.4) is also necesSary (cf. Chartier Faou & Murua 2005). 
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Theorem 7.1. Comsider 4 B-seriesy 7aetpod @1(Vy) 三 有 B(ay) and Proplenas 
1 = pavingQ@( 三 CU (wiisymaaetric maatrix C) as 太 st tfegral 

太 1pe coeHjcients a(7T) Satis 访 (7 人 四 加 en tpe aetpod exacthy co1aservey QU) aad 
it 18 Sy12DLectic. 


Proof al) Under the assumptions of the theorem we Shall prove in part (c) that 


jul+ol 


人 


BlwyW7cBlwJ=wCcy+>， 
VE 人 


with rz(uv) = al al) 一 auoo) 一 ao Condition (7.4) is equivalent to 
mu(wu) 一 0 and thus implies the exact conservation of QUV) = y7CV. 

To prove Symplecticity of the method it is sufficient to Show that the diagram of 
Lemma 4.1 commnutes for general B-series methods. This is seen by differentiating 
the elementary differentials and by comparing them with those for the augmented 
System (Exercise 8). Symplecticity of the method thus follows as in Sect. VI.4.1 
form the fact that the Symplecticity relation ls a quadratic first integral of the aug- 
mented System . 

b) Since Q(y) = Co is a first integral of y = ,wehave Cj) =0 
for all y. Differentiating 7 times this relation with respect to yy yields 


977 


CD 人 (本寺 CA (有 Rn) 一 0. 


J 王 1 


Putting 所 三 全 (7 六)(y) we obtain the formula 


77 


多 CF(m rm]) 切 = 一 >》 下 (万 (TCF TD TH TO)， 


J=1 
Which can also be written in the form 


rc Fr)() 亲 OO) EC 07.9 








0 人 7) 


UVUET UVoVU 一 古 


c) With (7.1) the expression 中 Cwl becomes 


Bawy)7CB(wg = 内 Cy+2%7C >》， 人 (r) Fr)(y) 
和 企 针 
| 区 | 十 |v| 
人 
IO 


Since C is Symmetric, formula (7.6) remains true 让 we Sum over trees ,UV Such that 
允 ou 三 T.JInserting both formulas into the Sum over T leads directly to (7.3). 
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了 上 xtension to P-Series. All the previous results can be extended to partitioned meth- 
ods. To find the correct conditions on the coeffcients of the P-series, we use the fact 
that the numerical solution ofapartitioned Runge-Kutta method (II.2.2) is aP-series 


全 二 。 (Po， 了 的 人 如 a(z) 刁 (uw)(po, ao) ] 
妥 Ta(o, (Po,go)) 浊 2 计 TP， 和 au) 天 (uv)(po, do) 





(77) 
with coefficients w(7) given by 
”bidi(T for 7T7E7P 
a(7) 社 2 Y 人 ) 卫 (7.8) 
> 50i(7T) for 7E7T7P 


(see Theorem II.2.4). We assume here that the elementary differentials 严 (7)(D, 9) 
originate from a partitioned sytem 


六 三 万 (P,9)， d= 户 (,9)， (7.9) 
Such as the Hamiltonian system (1.7). This time we multiply (4.3) by bi(u) 0) 
(where 允 三 [umlp ETPandu 三 [ouUllye 7TP) and we sum over 


allz and7. USsing the recursion (JIU.2.7) this yields 


opoa+ 和 Doiooa =- ( 症 oa) ( po) (07.10) 


whereou 三 [umyulpandvov = [ua ubU Because of (7.8), this 
implies the relation 


auoU) 二 ao 三 al au) for ETP， VETP. (7.11) 
Since di(7) is independent of the colour of the root of r, condition (4.4) implies 


a(T) is independent of the colour of the root of 7, (7.12) 


Theorem 7.2. Comsider ad P-series 1aetjpod (D1;,q1) = 于 (Do,qo) give py (7. 作 
a1da poplem (79) javing Q(D,9g) 一 DT 已 g as Jst jntegral 

动 矿 1Pe coefjicients a(T) Satis 有 (7.17) ad 17.72) 1Pe 1aetpod exactiy co11geTves 
Q(D;,d) and iiSy1Pplectic Jorgeneral Fa1ailtomia1a syste1ag (7.7) 

让 矿 1e coefjicients a(T) satish oa1y (7.77 te 7aetpod exacty co1agerves 
Q@(D,d) Jor proples of tpe jorrD = 万 ( 四 9 = 户 (D)，and 1 sy1iDpiectic Jor 
SeDa1aD1e 瑟 Ga11alizlto1lia1z syste11g Wpe1e 五 (D,9q) = 了 (D) 十 U(q). 


Proof This is very similar to that of Theorem 7.1.  Q@(p,dg) = 太 is a first 
integral of (7.9), we have 方 (p,g7 瑟 go 二 2 五 户 (p,g) 三 0 forallpand dg. Differ- 
entiating 7m0 tmes with respect to p and mu times with respect to dg yields 
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770 门 匈 工 
0 一 忆 p DY 户 (P 9 有 rn] 五 0 





二 玉 瑟 DYDY 户 (OP9) yj 人 yb (7.13) 
7 人 开 

S >》 7 忆 v 万 (D gp] 古 入 
J=1 

二 >》 民 百 DVDI 户 (0 
这 1 


Putting js 三 下 ()(P;,9) with Wi E 7TP 妨 三 下 (oj)(D9) with wj E TP mm 三 
[auUmy ut Unp and7o 三 [Un ul un weobtain as in part 
(b) of the proof of Theorem 7.1 that 


Co) gg) 
al7p) 0(79) 
二 汪 。 2 的 作 全- 2 四 CO 证 9 


9 二 7TPp Vou 一 Ta 








Where the sums are over VE 7 记 andveE 7 
With (7.7) the expression zf 已 0 becomes 


已 (oa (9 巨 Po)) = 了 克 Pg (7.15) 
| |2| 
+ 于 和 POPOrPe+ PE oPOg 
wuETP。 ucTP。 
ul+ol 
aluja(o) Gu)(P, 9 五 下 (wo)(P,9). 





ucTBerpl coto) 


Condition (7.12) implies that ao(mp) = w(7o) forthe trees in (7.14). Since also |7| 三 
| 冯 | and c( 疡 ) = al(m), two corresponding terms in the sums of the second line 
in (7.15) can be jointly replaced by the use of (7.14). As in part (c) of the proof of 
Theorem 7.1 this together with (7.11) then yields 


忆 (a (2g) 玫 瑟 忆 (ao (9)) 三 克 瑟 w， 


which proves the conservation of quadratic first integrals 27 已 q. Symplecticity fol- 
lowsgs as before, because the diagram of Lemma 4.1 also commnutes for general P- 
Series methods. 

For the proof of statement (iD we notice that 万 (gg)7 五 十 2 五 户 (p) =0 im- 
plies that 方 ()7Pg=0 and 2 五 户 (p) =0 vanish separately. Instead of (7.14) 
we thus have two identities: the term 下 ( 疡 )(D, 9g)7 瑟 oj/u( 记 ) becomes equalto the 
first sum in (7.14), and 了 太 互 PP(m)(,9q)/e(m) to the second sum. Consequently， 
the previous argumentation can be applied without the condition a(7) = a(75). 
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Second Order Differential Equations。 We next consider partitioned Systems of 
the particular form 


六 = 广 (9 4 一 CD 二 2， 47.10) 


where C is a matrix and c a vector. Since problems of this type are Second or- 
der differential equations 4 = CC 万 (d), partitioned Runge-Kutta methods become 
equivalent to Nystrom methods (See Sects.IL.2.3 and IV.2.3). 

An important Special case are Hamiltonian Systems 


盖 一 VU(d)， dd 一 CDp 二 c (7.17) 


(or equivalently, 4 = -CCVU(q)). They correspond to Hamiltonian functions 
| 
瑟 Og) = CD 二 cpD+U(g)， (7.18) 


where the kinetic energy is at most quadratic in Dp (here, C is usually Symmetric). 
Jn aP-series Tepresentation ofthe numerical solution, many elementary diffteren- 
tials vanish identically. Only those trees have to be considered, whose nelghbouring 
vertices have different colour (the problem is separable) and whose white vertices 
have at most one Son6 (second component is linear). We denote this set of trees by 


neighbouring vertices of 7 have different colour 


TV 二 { TP ee 
了 人 White vertices of 7 have at most one Son 


上 (7.19) 
and we let Yu be the corresponding Subset of 7 

The Same procedure as for partitioned methods permits us to write the Symplec- 
ticity condition of Theorem 4.8 in terms of the coefficients wa(7) of the P-series. 
Assuming a(。) = wa(o) = 1,the two conditions of (4.5) lead to 


alooT+Taluwoo)=alu)al(o) forwe 7TN (7.20) 

auja(oeov) 一 aluwoov)=aloeouwaluv) 一 aoouw forwveTN (07.21) 
where we use the abbreviating notation 

uUoou 一 Vol(oeou)= 三 [ua ,umnyluyv (7.22) 


这 尺 三 [ua .WUmjp Notice thatforw VE 7TNpthetreeswoo,uWoouwanduoou 
arein 7TNpandoowiin 7N。. 


Theorem 7.3，Cozsider a P-series 11etjpod (7.]) Jor dierential equatio1ay (7.76) 
having Q(D;, dg) 一 2 已 g as 万 st in1egral 

矿 1pe coejjicients a(T) Satish (7.20) ad (7.2 册 加 e 1metpod exactiy co1lyerves 
Q@(D;,d) aadiisyiplecticJor Panmailtomian syste11y Wi 加 瑟 (D, 9) oftjpe Jora(7.78). 


5 Attention: with respect to (IIL2.10) the vertices have opposite colour, because the linear 
dependence is in the second component in (7.17) whereas it is in the first component in 
(LIU.2.9). 
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Pioof Since the elementary differentials 忆 (7)(p,9) vanish identically for 7 人 
7TN U 7TN we can arbitrarily define a(7) for trees outside TNn U 7TN。 with- 
out changing the method (throughout this proof we implicitly assume that for the 
considered trees neighbouring vertices have different colour). We Shall do this in 
Such a way that (7.11) holds. 

Consider first the tree ooV. There is exactly one vertex between the roots of 以 
and V. Making this vertex to the root gives the tree [uvj which is not in TV We 
define forvE 7Nr 


alwala) 一 agoaleow 一 aluooo)， 


Condition (7.21) shows that a([w,vjv) is independent of permuting v and vw and is 
thus well-defined. For trees that are neither in 7TNn U 7TNu nor of the form [uv]。 
with 由 VE 7TNn we let a(7T) = 0. This extension of a(7) implies that condition 
(7.11) holds for all trees, and part (ii) of Theorem 7.2 yields the statement. Notice 
that for problems = 万 (四 , d = 户 (D) only trees, for which neighbouring vertices 
have different colour are relevant. 














VI.7.2 Characterization of Symplectic P-Series (and B-Series) 


A characterization of symplectic B-series was first obtained by Calvo 多 Sanz-Serna 
(1994). We also consider P-series with various important Special Situations， 


Theorem 7.4，Co7zsider ad P-seriesy 11etjpod (7.7) applied io ad ge1ie1al Da1LIIO1ed 
di1eretial equation (7.9). EggUiVaQLent a1e: 
7) 1pe coefjcients a(T) Sa (7.17) and (7.72) 
2) quadcyafic jjstf 加 tegralsy oftpe jpora Q(D,g) = 了 瑟 gare exactiy comserved 
3) 1pe 111etpod 18 8Sy1IDlectic Jor ge1le1al Ba111111o1ia1 8SyS1e118 (7.7]). 


Proof The implication (1) 全 (2) follows from part (D of Theorem 7.2, (2) 僵 (3) ls a 
consequence of the fact that the symplecticity condition is a quadratic first integral of 
the variational equation (See the proof of Theorem 7.2). The remaining Implication 
(3) 僵 (]) will be proved in the following two steps. 

al) We fix twotrees E 7 记 andvE 7PFoand we construct a (polynomial) 
Hamiltonian such that the transformation (7.7) satisfies 





他 人 > 主 clatu ouTaluo 一 al ao)) (7.23) 
DDI Dai 

with C 和 0 (here， 三 denotes the first component of po, and 0 the Second compo- 
nent of go). The symplecticity of (7.7) implies that the expression in (7.23) Vanishes， 
So that condition (7.11) has to be satisfied. 

For given  E 7 andvcE 7 we define the Hamiltonian as follows: to the 
branches of v ov we attach the numbers 3,... ,| 十 lv| 十 1such that the branch 
between the roots of v and v is labelled by 3. Then, the Hamiltonian is a Sum of 
as many terms as vertices in the tree. The Summand corresponding to a vertex 1$ a 
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ovyv 克 


2 DO1 VOL 





Fig. 7.1. IJlustration of the Hamiltonian (7.24) 


product containing the factor Dj (resp. 97) 直 an upward leaving branch“ 思 is directly 
connected with a black (resp. white) vertex, and the factor % (resp. D2) 让 the vertex 
itself js black (resp. white) and the downward leaving branch has label “7 Finally， 
the factors q2 and pl are included in the terms corresponding to the roots of v and 
Vu, TeSpectively. For the example of Fig.7.1 we have 


刀 (p,g) = g203d40 十 Dlp3p778 十 Do 十 呈 十 g5 十 0 十 9 (7.24) 


The components F(7)(p,9q) of the elementary differentials corresponding to 
the Hamiltonian System (with the Hamiltonian constructed above) satisfy 


F2(uov)(9g) = (-D5ooeo)oluoo) Ph 

Fillooa)(D9g) = (-D5oooo(looz) 02， 
Pa,g = (Daocl.e 
Fa(o)(p,q) = (-Dito)c(o) pu 


and for all other trees7T E 7 and components ? we have 


DOF:(7) 
DD1L 


DF(7) 
D02 





(0,0) 三 (0,0) = 0. 

In (7.23), 6(7) counts the number ofblack vertices ofr, and the syrazjatetry coejiciert 
alT) is that of (IIL.2.3). For example, a(uw) = 1 and olv) = 2 for the trees of 
Fig.7.1. The verification of (7.25) is straightforward. The coefficient (一 1)54m is due 
to the minus Sign in the first part of the Hamiltonian System (1.7), and the symmetry 
coeffcient c(7) appears in exactly the same way as in the multidimensional Taylor 
formula. Due to the zero initial values, no elementary differential other than those 
of (7.25) give Tise to non-vanishing expressions in (7.23). Consider for example 
the Second component of F(7)(p,9) for atree7T E TP. Since we are concerned 
with the Hamiltonian System (1.7), this expression starts with a derivative of 已 o>. 
Therefore, it contributes to (7.23) at po = 9qo = 0 only iit contains the factor 
Losoap5 (for the example of Fig.7.1). This in turn implies the presence of factors 
厂 pa ,万 .and 万 js .Continuing this line ofreasoning, we find that F2(7)(p,9) 
contriputes to (7.23) at po 一 qo 三 0onlyi7 三 人 ov.With similar arguments we 
see that only the elementary differentials of (7.25) have to be considered. We now 
insert (7.2$) into (7.7), and we compute its derivatives with respect to pl and 02. 
This then yields (7.23) with C = (一 1)500+5(o)jul+lol， and completes the proof 
concerning condition (7.11). 
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b) The necessity of condition (7.12) is seen Similarly. We fx atree7 E 7 户 
and welet 寺 ETPo bethetree obtained from 7 by changing the colour of the root. 
We then attach the numbers 3,...,|r| 十 1to the branches of r, and we define a 
Hamiltonian as above but, different from adding the factors g2 and pl1,， we include 
the factor pld?2 to the term corresponding to the root. For the tree7 一 of Fig.7.1 
this yields 

瓦 (D,g) = 太 生 全 六 十 太太 十 有 十 生 . 
With this Hamiltonian we get 


F2(r)(D9) 三 (1250o(7) DL 
FF)(D9) = (-15o(7) .92， 


and these are the only elementary differentials contributing to the left-hand expres- 
Sion of (7.23). We thus get 


ol,g1)NT 7OU1,91) 6(r)7 7| 
人 7 二 = 的) 
which completes the proof of Theorem 7.4. 














Theorem 7.S. Co1lsider da P-se1iesy 11etpod (7.Z) applied 1o Ga sepa1apie Pa1titiomed 
Cerential equation 六 = 万 ( 四 = 户 (D). Eouivalent are: 
7) 1je coefpcients a(7T) Sa (7.7 几 
2) quadrafic jzsf integyals oftpejora Q(D,dg) 三 了 gare exacthy comserved 
3) 1pe1metpodisy1iplecticjorseparaple Fammailtonians (Dp,9) = 了 (D) 十 Cd). 


Proof The implications (1) 全 (2) 人 (3) follow as before from part (ii) of Theo- 
rem 7.2. The remaining Implication (3) 僵 (]J) is a consedquence of the fact that the 
Hamiltonian constructed in part (a) of the proof of Theorem 7.4 is separable, when 
and w have no neighbouring vertices of the Same colour. 














Theorem 7.6. Comsider a B-series 1metpod(7.1)Jjory = 帮 八 . Eaguivalent aye: 
7) 1jpe coefjcients a(7T) sai 及 (7 和 
2) quadrafic jzsf integrals oftpe jp QU = CU are exactiy comserved 
3) 1pe maetpod isymaplectic jporgeneral Famailionian syste11g1 = JIV 万 (I. 


Proof The implications (]) 僵 (2) 全 (3) follow from Theorem 7.1. The remaining 
implication (3) 僵 (1) follows from Theorem 7.4, because aB-series with coefficients 
alT),TET,appliedto apartitioned differential equation, can always be interpreted 
as aP-series (Definition HI.2.1), where a(7) := a(P(7T)) forrE 7TPandp:7P 一 
7 ls the mapping that forsgets the colouring of the vertices. This follows from the 


fact that 让 多 坝 
也 了 二 FT QUL 
5 | 
VETPo,P(u) 王 了 au) 天 (v )(P， 9g) 
for7r ET,because au cu) = av: cu) =e(7T) :|rll. Here,y =(D9qj,the 
elementary differentials 忆 (7)(y) are those of Definition II.1.2, whereas 严 (z)(P, 9) 
and 天 (uv)(D,9) are those of Table II.2.1. 
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Theorem 7.7. Co1lsider a P-series 11etpod (7Z.Z) applied io 1je Special Pa1titiomed 
SyS1e1a (7Z.70). EqgUivaLent Q1e: 
7) 1pe coefjcients a(T) Sa 及 (7.20) ad (7.2 由 
2) quadrafic jzrsf integ1als oftpejorpa Q(D,dg) 三 了 瑟 gare exacthy comserved 
3) 1pe 111etpod 18 8Sy1Dlectic Jor Famatlio1ia1a syste118 of ipe Jora (7.77]). 


Proof The implications (]) 僵 (2) 僵 (3) follow from Theorem 7.3. The remaining 
implication (3) 僵 (1) can be Seen as follows. 

Condition (7.20) ls a consequence of the the proof of Theorem 7.4, because for 
wwE7TNpanduv=。theHamiltonian constructed there is of the form (7.18). 

To prove condition (7.21) we have to modify slightly the deftinition of 妃 (P, q). 
We take VE 7TNn and define the polynomial Hamiltonian as follows: to the 
branches of woovw we attach the numbers 3,.…. ,| 才 十 lv 十 2.The Hamiltonian is 
then a sum of as many terms as vertices in the tree. The summands are defined as in 
the proof of Theorem 7.4 with the only exception that to the terms corresponding to 
the roots of w and vw we include the factors g2 and 9gl, respectively, instead of gq2 and 
DLL. This gives a Hamiltonian of the form (7.18), for which the expression 


O(D1, 01) 二 D(D1;,91) 
(的 ro 
becomes equal to 
alujaloeoo) 一 aluwoou) 一 aleouwjaw) 十 auooz) (之 力 


up to a nonzero constant. By Symplecticity, (7.20) is zero So that also (7.27) has to 
vanish. This proves the validity of condition (7.21). 














VI.7.3 Irreducible Runge- 下 utta Methods 


We are now able to Study to what extent the conditions of Theorem 4.3 and Theo- 
rem 4.0 are also neceSSary for Symplecticity. Consider first the 2-stage method 


1/2| Qw 1/2 一 a 
1/2| 6 1/2-8 
| 12 1/2 





The solution of the corresponding Runge-Kutta system (II.1.4) is given by Ki 一 
Ka2 一 [, where 有 = /yo 二 KR/2),andhenceyi = 十 AKE. Whatever the values of a 
and DO are, the numerical solution of the Runge-Kutta method is identical to that of 
the implicit midpoint rule, so that it defines a Symplectic transformation. However， 
the condition (4.2) is only satisfied for a = = 1 /4. 


Definition 7.8. Two stages7 and7 of a Runge-Kutta method (1I.1.4) are Said to be 
edUivalentjor aclass (P) of initial value problems, if for every problem in ( 刀 ) and 
for every sufficiently small step size we have 应 = 1 (2 = 1 andti 三 人 for 
partitioned Runge-Kutta methods (II.2.2)). 
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The method is called rreduciple jor (P) 让 it does not have equivalent Stages. 
It is called ?rreducipie 寺 it is irreducible for all suffticiently Smooth initial value 
problems. 


For a more amenable characterization of irreducible Runge-Kutta methods, we 
introduce an ordering on 7 (and on 1), and we consider the following s x co 
matrices 


DRk = (%(7)i7ET) with entries bi(7) = gi(7) given by (IIL1.13)， 

ZpRggk = (bg(T)iireTP)= (or)ireTP) with entries bi(7) givenby (II.2.7); 
observe that bi(7) does not depend on the colour of the root， 

GiRk = (Or)i7E 7TP”) = (W(r)i7re 7TP7) Where 7 (resp.TP7) is the set 
of trees in 7 (resp. 7P) whose neighbouring vertices have different colours. 


Lemma 7.9 (Hairer 1994). 4 Razpge-Kztta 1Ietpoad is rreducipie 矿 amnd oly 矿 1pe 
1IG11IX GORK jay Je1l 7a7K 5. 

4 Pa1titio1led Ri118e-KU1tG 11etpod 18 rreducipie 矿 a1zd oly 矿 1Pe 1IQ1IX OPRK 
Has Joell 7a1K 5. 

4 Pa1LLO1ed Ru118e-KU1ta 11etpod 18 1rreduciple jor sepa1apie p1ople11y 7D 三 
万 (q), = 户 (P) 矿 and only 太 1pe 71aatrix GPRg Pas jl7anK s. 


Prooj Ithe stages7andy are equivalent, it follows from the expansion 


|7| 
已 = 和 二 br) PC)() 


TE 了 人 7) 


(See the proof of Theorem II.1.4) and from the independency of the elementary 
differentials (Exercise II.3) that di(7) = gji(7) for all 7 E 了 . Hence, the rows 
1 and7 of the matrix OORK are identical. The analogous statement for partitioned 
Runge-Kutta methods follows from Theorem IIU.2.4 and Exercise II.6. This proves 
the suffhiciency of the“full rank” condition. 

We prove its necessity only for partitioned Runge-Kutta methods applied to Sep- 
arable problems (the other situations can be treated Similarly). For separable prob- 
lems, only trees in 7 U To give rise to non-vanishing elementary diffterentials. 
Ireducibility therefore implies that for every pair ( 冯 7) with 夫 7 there exists atree 
TE 7TP* such that bi(7T) 夫 %(7). Consequently, a certain finite linear combina- 
tion of the columns of GPRk has distinct elements, ij.e., there exist vectors 5 E 豚 光 
(only finitely many non zero elements) and 7 E 下 * with CiRgEe 一 mand71i 夫人 7 
fori 夭 7 思 .Due to the fact that bi 人 (| Tom]) = 大 (六 赂 (Im]),the com- 
ponentwise product of two columns of wpRk is again a column of OpRk. Continuing 
this argumentation and observing that (1,...,1J)7 is a column of GzRk，we obtain 

了 一 1 


amatrix X Such that gjRkX = (7 )27-1 is a Vandermonde matrix. Since the 1 


are distinct, the matrix OpRKk has to be of full rank s. 














7 Inthis section weletd(r) E 下 * denote the vector whoseelements are bi(7),1 一 1. .5. 
This should not be mixed up with the value %(7) of (II.1.16). 
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VI.7.4 Characterization of Irreducible Symplectic Methods 


The necessity of the condition (4.2) for Symplectic Runge-Kutta methods was first 
stated by Lasagni (1988). Abia 儿 Sanz-Serna (1993) extended his proof to parti- 
tioned methods. We follow here the ideas of Hairer (1994). 


Theorem 7.10. 47 777reduciple Ru1age-KUtta 1Ietjpod (171.4) 1 Sy1iDlectic 矿 a11d 
O11y 矿 1Pe cozditiom (人 2) 1olds. 

471 .777educipie Partitio1ed Ru118ge-KU1ta 1Iethod (11.2.2) 18 Sy1Dlectic 夏 a1ad opy 
矿 1pe co11ditio18 (4.3) G110 (4.4) 1old. 

4 Da11i1ioed Ru118e-KU1ta 11ethod i7rreduciple Jorseparaple Pp1rople111s，18 8y1I- 
Piectic Jorseparaple 瑟 aailtonians (D,9) =T(D) 十 U(9) rand om1y 矿 ipe comdi- 
1o1 (4.3) 1olds. 


Prooj The “这 ”part of all three statements has been proved in Theorem 4.3 and 
Theorem 4.6. We prove the“only 让 ”part for partitioned Runge-Kutta methods 
applied to general Hamiltonian Systems (the other two statements can be obtained 
in the Same way). 

We consider the s x s matrix M with entries 7Di7 三 DG 十 Daji 一 已， The 
computation leading to formula (7.11) shows thatforw E 7 天 andvE 75 


goO7Md(o) = aluoo+alooog -ad :at 


holds. Due to the Symplecticity of the method, this expression vanishes and we 
obtain 
OPRKM GpRk = 0， 
Where JpRK ls the matrix of Lemma 7.9. An application of this lemma then yields 
Ad = 0, which proves the necessity of (4.3). 
For the vector qd with components wi 三 访 一 已 We get ad7GpRgk 一 0, and we 
deduce from Lemma 7.9 that qd = 0, so that (4.4) is also seen to be necesSary. 














VI.8 Conjugate Symplecticity 


The symplecticity requirement may be too Strong 让 we are interested in a correct 
long-time behaviour of a numerical integrator. 9toffer (1988) suggests considering 
methods that are not necesSsarily Symplectic but conjugate to a Symplectic method. 


Definition 8.1. Two numerical methods 2 and ware mutually cozzjxgate, ithere 
exists a global change of coordinates Xn, Such that 


于 一 Xo 萝 oXn (8.1) 
We assume that xn(y) = 十 O(P) uniformly for y varying in a compact Set. 


For a numerical solution yn+l = gp(yn), lying in a compact subset of the 
phase space, the transformed values 2 = Xn(Wn) constitute a numerical solution 
2+1 三 殉 ( 加 ) of the second method. Since 轨 一 加 = O(P), both numerical 
solutions have the Same long-time behaviour, Independently of whether one method 
Shares certain properties (e.g., Symplecticity) with the otherT. 
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VI.8.1 Examples and Order Conditions 


The most prominent pair of conjugate methods are the trapezoidal and midpoint 
rules. Their conjugacy has been originally exploited by Dahlquist (1973) in an in- 
vestigation on nonlinear Stability. 

开 we denote by 瑟 几 and G7 the explicit and implicit Euler methods, respectively， 
then the trapezoidal rule 下 / and the implicit midpoint rule GE can be written as 


二 jj 相 51/2 GE 一 Na S Gj]2 


(See Fig.8.1). This Shows 瑟 / 三 XXX with Xn 一 字 N2， implying that the 
trapezoidal and midpoint rules are mutually conjugate. The change of coordinates， 
Which transforms the numerical solution of one method to that of the other is O( 户 )- 


close to the identity. 




































































四 一 | tap 二 人 @ 一 | ap 二 > 人 人 

和 和 

O(]) < 租 : < AN 
e 一 | Vmidp. 二 > 全 一 | midp 二 > 人 @ 一 | midp. >。 
@ 一 mol >@ 一 Im > @ 




















Fig. 8.1. Conjugacy of the trapezoidal rule and the implicit midpoint rule 


了 fact we can do even better 开 we let By/2 be the Square Toot of 画 信 (1.e.， 
区 jp/2 o 21/2 三 1 ， See Lemma V.3.2), then we have (Fig.8.1) 


7 一 (Ba) O GE 各 Na 一 (Ba) O 写 77/2 O 宛 772 O 〇 772 O 〇 G77 答 GE12 


So that the trapezoidal and the midpoint rules are conjugate via Xn 三 7 O G2 12， 
Since @hn/2 and 也 /2 are both consistent with the Same differential equation, the 


transformation Xhn is O(h2)-close to the identity. This shows that for every numeri- 
cal solution of the trapezoidal rule there exists a numerical solution of the midpoint 
rule which remains O(72)-close as long as it stays in a compact set. A single trajec- 
tory of the non-Ssymplectic trapezolidal rule therefore behaves very much the Same 
as atrajectory of the symplectic implicit midpoint rule. 


A Study via B-Series. An investigation of Runge-Kutta methods，conjugate to 
a Symplectic method, leads us to the following weaker requlirement: we Say that a 
numerical method @j is co1j1zgate 1o QSy1Plectic 1ietjhod 用 岂 1oordery,ithere 
exists a transformation Xn(y) 王 V 十 @O(P) such that 


环 ( 门 = (Xeo 琴 oXhn( 人 二 OATD)， (8.2) 


This implies that the error of such a method behaves as the Superposition of the error 
of a Symplectic method of order 2 with that of a non-Ssymplectic method of order 7. 
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In the following we assume that all methods considered as well as the conjugacy 
mapping Xj can be Tepresented as B-Sseries 


2( 幼 二 Bo 人 四 幼 =B 人 人， Mn 一 Be (83) 
Using the composition formula (II.1.38) of B-series, condition (8.2) becomes 
(ac)j(7) = (cb)(7) for |Irl 系 7 (8.4) 
The following results are taken from the thesis of P Leone (2000). 


Theorem 8.2. ZefG1(V) = 已 (ay) represent da 11aerical 1metpod ororder 2. 
Cj TIS GAayS Co11U8aie 10 G Sy11DUectic 1Ietpod zpD 1o order 3. 
DJ DNS co111U8ate 10 QSy11Dlectic 11etjod op io order 4 矿 azad ozpy 矿 


avV) -2a(。 力 =0 太志 (8.3) 
玉 eje, we USe 1pe apbreviation auU) = au alw) 一 aoo) 一 auo). 


Pioof The condition (8.4) allows us to express b(7) as a function of a(w) for |v| 芯 
7r| and of c(w) for |v| < |Ir| 一 1 use the formulas of Example II.1.11). All we have 
to do is to check the symplecticity conditions gu) = 0 for | 直 十 lv| 和 (see 
Theorem 7.0). 

Since the method Gn is of order 2, we obtain bf(。) = 1 and 0( 7) = 1/2. We 
arbitrarily fx c(。) = 0, so that the symplecticity condition bg(。, / ) = 0 becomes 
2c( 1/ ) = a(。, 7 ). Deftining c( yy ) by this relation proves statement (3a). 

For order 4， the three Symplecticity conditions b(。,V) = 20,[[]]) = 
b( 7 ,7 ) = 0haveto be fulfilled. One of them can be satisfied by deftining suit- 
ably c(\W) + c([[]]); the other two conditions are then equivalent to (8.5). 














Theorem 8.3. ZefGjp(V) = B(a,y) 7epresent da 11U1te1ical 1aiethod ofrorder 4 并 他 
CO1J1U8ate 10 G Sy11Dlectic 1ietpod zpD 1o order 5, 矿 a1ad opy 矿 


IE 和 
二 


Prooj The idea of the proof is the Same as in the preceding theorem. The verifica- 
tion ls left as an exercise for the reader. 














了 xample 8.4. A direct computation Shows that for the Lobatto IIB method with 
5 = 3wehaveoa(/V) = 1/144, and al(wv) = 0 for all other pairs with 


| 直 十 |v| = 5. Theorem 8.3 therefore proves that this method is not conjugate to 
asSymplectic method up to order 5. 
For the Lobatto IIA method with s = 3 we obtain a(/ ,WV) = -1/144， 


ay ,|[[]]) = -1/288, and aluwv) = 0forthe remaining pairs with |v| 十 |v| = 5. 
This time the conditions of Theorem 8.3 are fulfilled，so that the Lobatto IIA 
method with s 一 3is conjugate to asSymplectic method up to order 5 at least. 
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VI.8.2 Near Conservation of Quadratic First Integrals 


We have already met in Sect. VI.4.1 a close relationship between Symplecticity and 
the conservation of quadratic first integrals. The aim of this section ls to Show a 
Similar connection between conjugate Symplecticity and the near conservation of 
quadratic first integrals. This has first been observed and proved by Chartier, Faou 
多 Murua (2005) using the algebra of rooted trees，. 

Let QU/) = 7 Cy (with symmetric matrix C) be a quadratic first integral of 
=,and assume that 2B1(V) is conjugate to a method 奈 (y) that exactly con- 
SerVves quadratic first integrals (e.g., Symplectic Runge-Kutta methods). This means 
that yn+1 一 21(Vn) satisfies 


Xp OCX = Xp Cxn(onh， 


and the expression Q(J) 一 Xhn(V)LCxn(y) is exactly conserved by the numerical 
Solution of Gj(V). 开 Xn(y) 三 已 (cy) isaB-series, this is of the form 


qOJJ= >》 jh 的 Er)7CF(OO()， (8.6) 


T;9ETU{O1 


where 严 ( 轨 (y) = yand Il 外 = 0 for the empty tree, and BO,0) = 1. We have 
the following criterion for conjugate Symplecticity，where all formulas have to be 
interpreted in the Sense of formal series. 


Theorem 8.5. 4sstu1zae 1Pat a ojze-step 1ietjpod Gpn(y) 三 已 (ay) 1eaves(8.0) va7- 
antjorall proplems7 = 0) paving QU = VCU as Jizrsf integ1roL 

7T7je1 过 六 co1jzgate 1oQGSy1iD1ectic inateg1ator 且 (2 直 Le tpere exisf atra11sjo 广 
1201ion 2z 一 Xi() = 已 (cg) sucp1pat 萝 (z) = XioGoxX(z) oreqguivalentty 
玩 (z) = 再 (clac,z) isy1iplectic. 


Pioof The idea is to search for a B-series 已 (c,y) such that the expression (8.0) 
becomes 中 
QUy) 二 (c， 四 7C (c， 2. 
The mapping z = Xn(y) = 已 (c,y) then provides a change of variables such that 
the original first integral Q(z) = 2Cz is invariant in the new variables. By Theo- 
rem 7.0 this then implies that Vis Symplectic， 
By Lemma 8.6 below, the expression (8.0) can be written as 


GO =yrcl(y+y An)FOO)O)， (7) 
OET 


Where 11(0) = 0 for |0| < , 让 the perturbation in (8.6) is of size O(17). Using the 
Same lemma once more, we obtain 


Blegrcalea = wyrclo+3 开 人 coOFOU] 
和 


1 本 
HorC( 工 (2 开光 0 crjct)P(O)G) 


ge 了 TDE 了 
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A comparison of the coefficients in (8.7) and (8.8) uniquely deftines c(O) in a recur- 
Sive manner. We have c(0) = 0 for |0| < 7, so that the transformation z = 妃 (c,y) 
is O(N) close to the identity. 














The previous proof is based on the following result. 


Lemma 8.6. Let QU) = CU (wii symaaetric matrixr C) pe a Jizsf integral or 
1 三 人 .T1jem Jjoreverypair oftreeyT ET wehave 


Fr)WrCFO)W=FC(y are(O)P(O)() 
OET 
T71ig SU1a is Jaite da1zd only over treesy satishing |0| = |r| 十 12. 


Proof By definition of a first integral we have VC jy) = 0forall y. Differentia- 
tion with respect to V gives 


FTCETWCNPONE=0 forall 大 . (8.9) 


Putting 天 三 下 (O)(z,this proves the statement for7 一 。. 
Differentiating once more yields 


(FOODOOCETCCNOOETC 产 OO=0， 


Putting 4 = jy) and using (8.9)，we get the statement for 7 = /. With L == 
五 ()(y) we obtain the statement for 7 = | 九 ] provided that it is already proved for 
九 . We need a further differentiation to get a similar statement for 7 = |,T2], etc. 
The proof concludes by induction on the order of 7. 














了 Partitioned Methods. This criterion for conjugate symplecticity can be extended 
to partitioned P-series methods. For partitioned problems 


We consider first integrals of the form 了 (p,9) = 了 已 0, where 已 is an arbitrary 
constant matrix. 二 p(p,dq) is conjugate to amethod that exactly conserves 到 (D, 9)， 
then it will conserve a modified first integral of the form 


Z(p,q) = > hiri+lal6(r 信 F(r)(p, 907 已 FOOD， (8.11) 
TETPU{0p}9ETPU{0o 
Where G( 务 ,加 ) 三 | 天 ( 加 )(p, 9) =D, 下 (0o)(D;,9) = 9. We firstextend Lemma 8.6 


to the new Situation. 


Lemma 8.7. Let 了 (pg) 三 2 已 gd peajizrstipntegral off(8.70). 7Tjpem jporevery pair 
orpeesTETPOETPFP we have 


Fo,OTPFO),9 = TB( > ro(O)F(OO)O 9g) 


beTP。 (8.12) 


过 人 次 ersa(OF(O)g) Ed 


gcTP， 


7Tjpese Su118 Gd1e jiaite ad oly over teesy Saishing | 中 | = |7| 十 |2|. 
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Pioof Since 卫 (D,9) 三 Do is a first integral of the differential equation，we 
have 万 (pg)7 五 go+D7 瑟 户 (pg) =0forallpandd.AsintheproofofLemma 8.6 
the statement follows from differentiation of this relation. 














Theorem 8.8. 4sszrae 1Pat a Partitioned one-step 11etpod 1(D,q) = 忆 (a,(D,9)) 
1eaves (8.17) jpvariant jpor all proplenas (8.10) paving LED) = OPd as Jst 
1721eg1QL. 

7T7jez iticojagate io asSy1iplectic iteg1ator 办 (WU Le tperematra1nsjor 广 
71120107 (WU) 一 Xi(D9) = 忆 (ci (DO) sucptpar 萝 (o) 一 XioEoxXri(v)， 
or eguiyalentiy 琴 (u) = Peiac (wu)) 访 sy1iDplectic. 
Pioof We search for a P-series P(c,(p,dq)) = (已 (ec， (D, 9 四 ) ,Pi(c,(P， 四 )) Such 
that the expression (8.11) can be written as 


Z(D9) = 已 (c (pg) 瑟 已 (c (pg))， 


As in the proof of Theorem 8.5 the mapping (wu) = Xpn(D,9q) = P(c,(D,9)) then 
provides the Searched change of variables. 
Using Lemma 8.7 the expression (8.11) becomes 


Fog0=mp(o+ MOFOO+( Pen)FUO OO) Ed 


geTP。 gcTP。 


Also 已 (c,(D,g)) 瑟 已 (c(p,9q)) can be written in such a form, and a comparison 
of the coefficients yields the coefficients c(7) of the P-series 已 (c, (p,9)) in arecur- 
Sive manner. We again have that P(c, (p,9q)) is O(j7) close to the identity, 这 the 
perturbation in (8.1l) is of size 〇 (PR7). 














The statement of Theorem 8.8 remains true in the class of second order differ- 
ential equations 4 = 万 (四 ,ie.,D = 万 ( 站 ,dd=2. 


VI.9 Volume Preservation 


The How wr of a Hamiltonian System Preserves volume in phase Space: for every 
bounded open set P C 陈 2d and for every ti for which ov(y) exists for all ye 2， 


vol(or(D)) = vol(D) ， 
where vol(O) = 几 dy This identity is often referred to as Ziormille's tpeorem. 
is a conseduence of the transformation formula for integrals and the fact that 


人 由 (9.D) 


0 
which follows directly from the Symplecticity and po 三 id. The Same argument 
Shows that every Symplectic transformation, and in particular every Symplectic in- 
tegrator applied to a Hamiltonian System, preserves Volume in phase Space. 
Meore generally than for Hamiltonian Systems, volume ls preserved by the fow 
of differential equations with a divergence-free vector field: 
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Lemma 9.1. 77jpe Jow ofa dizerenfialequation yy 三 UV) 太阳 "ivoluare-preserving 
tard oniy if divFy) 三 0 Joral11y. 


Pioof The derivative 节食 三 2 (yo) is the solution of the variational equation 





立 = 4(07， Y(0) = 了 


with the Jacobian matrix 4 的 = 万 (yy 的 ) at y 提 = wx(yo). From the proof of 
Lemma IV.3.1 we obtain the 4pe 广 Liouyie-/acop-Ostjrog7adsKii Ideatity 


d 

机 det 下 一 trace 4( 轨 .det 素 (9.2) 
Note that here trace 4 介 = divjy(). Hence, dety 轨 王 1foralt 这 andonly 让 
divj(y 扩 ) = 0foralt Since this is valid for all choices of initial values yo, the 
TeSult follows， 














了 xample 9.2 (ABC Flow). This fow, named after the three independent authors 
Arnold, Beltrami and Childress, is given by the equations 


之 一 4sinz 十 Cocosy 
1 一 Bsinz 十 4cosz (9.3) 
2 一 Csiny 十 已 cosz 





and has all diagonal elements of 放 identically zero. It is therefore volume preserv- 
ing. In Arnold (1966, p.347) it appeared in a footnote as an example of a fow with 
rot 太 parallel to 太 thus violating Arnold's condition for the existence of invariant 
tori (Arnold 1966, p.340). It was therefore expected to possess interesting chaotic 
properties and has since then been the object of many investigations Showing their 
non-integrability (See e.g., Ziglin (1996)). We jllustrate in Fig.9.1 the action of this 
flow by transforming, in a volume preserving manner, a ball in 了 3. We see that， 
VerIy Soon, the Set ls Strongly squeezed in one direction and dilated in two others. 
The solutions thus depend in a very Sensltive way on the initial values. 


Volume-Preserving Numerical Integrators. The question arises as to whether 
Volume-preserving integrators can be constructed for every differential equation 
with volume-preserving fow. Already for linear problems，Lemma IV.3.2 Shows 
that no standard method can be volume-preserving for dimension 风 _ > 3. Never- 
theless, positive anSwers were found by Qin 儿 Zhu (1993), Shang (1994a,1994b)， 
Feng & Shang (1995) and Quispel (1995). In the following we present the approach 
of Feng 贸 Shang (19935). The key is the following result which generalizes and 
reinterprets a construction of H. Weyl (1940) for 7 = 3. 


Theorem 9.3 (Feng & Shang 199S). Every diverge1ice-jree vector jield 厂 : 了 ”一 
时” ca1z pe wW1itte1n ay 1Pe Su1a ofm 一 1 vector jields 


三 = 万 2 十 户 3 十 .十 万 -un 


VI9 Volume Preservation 229 





Fig. 9.1. Volume preserving deformation of the ball of radius 1, centred at the origin, by the 
ABCfow;4=1/2 巨 =C=1 


Wje1e edc 大 DNS aailiomian 太 加 e variaples (WUVKT1): tere exXist jiactio1as 
古 RH1 疏 ” 一 有 SucP tat 


OFKEHL OHRRH1 














一 (0,...，,0， ,0, .0)7 . 
1 OUVk+1 OU ) 
Proof In terms of the components of 三 = (万 … 太 ), the functions 五 :b+1l 
Imust Satisfy the equations 
OF1> DFI 2 OFo2 3 
万 三 一 )》 户 = 全 
Oy2 OU Dys 
DO 万 2m 1 OF ln DO 万 1n 
| 一 ) 万 TEST 
Dyn_2 DOyn DOyn 1 
We thus set 
V2 
2 As 万 dv2 
0 


andfor 有 三 2),...) 妈 一 2 


+17DB 1 
的 | 一 一 -一 一 人 C 和 
KE 二 1 ( 访 ) VK+1 
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It remains to construct 妃 7 -1m from the last two equations. We see by induction 
that for 上 六 刀 一 2， 





D2 有 pH1 二 十 1 | 
OKOWK+1 0 ”8 





and hence the integrability condition for 妃 ” 1m， 





人 


reduces to the condition divj 三 0, which is satisfited by assumption. 妃 p 1nm can 
thus be constructed as 


2 DO 万， 和 Zn 一 1 
尖 六 二 站 六 一 小 (2 写 户 -1) aoyn + 亡 | 人 =0 dyn -1 ) 
0 Zn 一 2 0 











which completes the proof. 





The above construction also Shows that 


大 H1 二 (0, ,0, 太 四 9k; 一 0k+1; 0 .0) 


VR 二 1 O 广 0 太 
可 响 陋 一 :| 0 PR 地 
3 人 


for1 冬 下拉 允 一 2,andol = 三 0ando = 三 一 万 . 

With the decomposition of Lemma 9.3 at hand, a volume-preserving algorithm 
is obtained by applying a splitting method with symplectic Substeps. For example， 
as proposed by Feng 儿 Shang (1993), a second-order volume-preserving method is 
obtained by Strang splitting with symplectic Euler Substeps: 


WwWith 








灿 ) 克 | 水 亿 一 工 ,7| 亿 一 1 人 于: 伺 
2 人 o 国 0 
where 而 + isa Symplectic Euler step of length 岂 /2 applied to the system with 


Tight-hand side 刀 k+1, and * denotes the adjoint method. In this method, one Step 
攻 = 2 人 (yy is computed component-wise, in a Gauss-Seidel-like manner, as 


本 彤 ，， 
页 三 全 十 本 万 ( 页 92， 





四 万 四 灵 

亚 二 铸 十 三 大 (页 9 丈 :名 4) 十 训 儿 做 for 天 一 20 一 1 
加 尹 。 二 

殉 三 名 十 可 记 ( 全 下 ) (9.4) 


with 所 | 钛 二 9k( 玉 ， En ) VE， WE 十 1 和 ,np) 2 9k(Z1， SA )， VE 1 VE， ,yn )， and 
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一 、 


本 3 
磷 三 到 十 了 刀 ( 页 加) 





0 及， ES 汪 刀 

炙 二 丈 二 三 大 (页 天 亦 本 加) 一 了 丈 坊 for 玉 三 多 一 二 .2 
过 ER 人 

页 三 页 十 训 广 (页 , 罗 2) (9.3) 


With 玖 六 三 钛 ( 天 -区 ) 一 外 (下 ) The 
method is one-dimensionally implicit in general,but becomes explicit in the par- 
ticular case where 9 及 /6 人 = 0forall 有 K. 


Separable Partitioned Systems. For problems of the form 


= /2)， :一 0) (9.6) 


withy E 陈 也 ,2 E 玉 ",the scheme (9.4) becomes the symplectic Euler method, (9.3) 
its adjoint, and its composition the Lobatto IIUA - IIUB extension of the Stormer 一 
Verlet method. Since Symplectic explicit partitioned Runge-Kutta methods are com- 
positions of Symplectic Euler steps (Theorem VI.4.7), this observation proves that 
Such methods are volume-preserving for Systems (9.6). This fact was obtained by 
Suris (1996) by a direct calculation， without interpreting the methods as composi- 
tion methods. The question arises as to whether more Symplectic partitioned Runge 一 
Kutta methods are volume-preserving for Systems (9.0). 


Theorem 9.4. Every 8Sy11Plectic Rage-KU1ta 1Ietpod with at 11081 1yO 8S1a8ey 1 
VOLU111e-P1ese1Vi18 Jorsyste118 (9.6) of arpitrary Qi7ie118io1. 


Poojf (al) The idea is to consider the Hamiltonian System with 


吾 (w 凡 本 凤 7 十 玉 g()， 
Where (Wu) are the conjugate variables to (y, >). This system is of the form 


1 = jz) 习 一 一 9( 轨 7 


(9.7) 
2 = 9( 切 六 三 一 六 (z)7 


Applying the Runge-Kutta method to this augmented System does not change the 
numerical solution for (y, 2). For symplectic methods the matrix 


(Wi 21 UVU1) E 玉 0 
|) 人 | 


satisfies MT .NM = J which implies RT7 = 二 Below we shall show that det 人 一 
det 尺 . This yields det 尺 = 1 which implies that the method is volume preserving， 

(pb) O1e-slage 111ethods. The only Symplectic one-stage method is the implicit 
midpoint rule for which 尺 and 7 are computed as 
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(=-: 囊 )R 到 7 十 琴 (9.9) 


(+3 三 )7 TI- 下， (9.10) 


Where El is the Jacobian of the System (9.6) evaluated at the internal stage value. 


Since ( ) 
0 洒 2&1/2 】 
五 1 一 
1 0 


a Similarity transformation with the matrix 刀 = diag(7, 一 门 takes Pi to 一 古 . 
Hence, the transformed matrix satisfies 


(到 )(D-7D) = 了 + 下 ， 


A comparison with (9.9) and the use of det X7 = det X proves det 尽 = detT for 
the midpoint rule. 
(c) Tipo-stase 11etjpods. Applying a two-stage implicit Runge-Kutta method to 


(9.7) yields 
了 一 Aaill 五 1 一 六 al2 瑟 > 情 :1 有 了 
一 Ana21 五 1 了 一 Pao22 瑟 > 卫 > 了 
where Ri is the derivative of the (y, z) components of the ;th stage with respect to 


(yo, z0), and 忆 ; is the Jacobian of the System (9.6) evaluated at the ;th internal stage 
value. From the solution of this System the derivative 尺 of (9.8) ls obtained as 


了 一 Aa11 五 1 一 Anal2 一 > | 的 


尺 三 了 +(0EbEo) ( 一 pao2l 有 II 了 一 Paoo 忆 ， 了 


With the determinant identity 


忆 江 本 
det(D) det(X 一 环 0 站) = det 克 


凡 = detl(X) det(7 一 YX T)， 


which is seen by Gaussian elimination, this yields 


det(T@T 一 AN4- 世 站)@DD 


ee 





where 4 and bp collect the Runge-Kutta coefficients, and 成 = blockdqiag ( 己 1 , 已 2?). 
For D-17D we get the same formula with 妃 replaced by ET.If4is an arbitrary 
2 x2matrix, it follows from block Gaussian elimination that 


det(T@T 一 HA4g@D 站 下 I = det(T@T 一 An4Q@D 到)， (9.11) 


which then proves det 尺 = det 了. Notice that the identity (9.11) is no longer true 
in general 让 4is of dimension larger than two. 
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Fig. 9.2. Volume preservation of Gauss methods applied to (9.12) with 彤 三 0.8 


We are curious to See whether Theorem 9.4remains valid for symplectic Runge 一 
Kutta methods with more than two Stages. For this we apply the Gauss methods with 
s 一 2ands= 王 3tothe problem 


也 一 Sin 2 2 一 COS 2 2 一 Siny 十 COS2Z (9.12) 


with initial value (0,0,0). We show in Fig.9.2 the determinant of the derivative of 
the numerical fow as a function of ttme. Only the two-stage method is volume- 
preserving for this problem which is in agreement with Theorem 9.4. 


VI.10 Exercises 


和 
1. Let a and be the generalized coordinates of the douple 
pendulum, whose kinetic and potential energies are QNL1 
21 ，. ; 7722 ，. ; 
汉 :二 术 ( 阅 十 办) 十 (人 2 十 提 ) 17 
L = 719V1 十 7029V2. DA 
Determine the generalized momenta of the correspond- 
7722 


ing Hamiltonian System. 


2. A non-autonomous Hamiltonian System is given by a titme-dependent Hamil- 
tonian function 达 (p,d, 轨 and the differential equations 


咏 = 一 五 v(D, 9 加 ， = 万 p(P,9, 加 . 


Verify that these equations together with e = 一 右 (p,q 为 and t = 1 arethe 
canonical equations for the extended Hamiltonian 万 (六 0) = 瑟 (p,d 加 十 e 
with 万 = (pp,e) andd = (d 放 ， 

3. Prove that a linear transformation 4 : 下 2 一 月? is symplectic, if and only 让 
det 4 = |. 

4. Consider the transformation (7, 2) Fr* (Pp,9q)j, defined by 


D=VWr)cosp， 9=W(r)sinw， 


For which function wm(r) is it a symplectic transformation? 
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Prove that the definition (2.4) of 2(AM) does not depend on the parametrization 
,le., the parametrization wm = 2 o Q, where wa ls a diffeomorphism between 
suitable domains of 及 2, leads to the same result. 


. OnthesetV = {(po);22 二 092 > 0} consider the differential equation 





态 1 也 
二 5 10.1 
加 D2 十 02? (1) ee 
Prove that 


aj its How is Symplectic everywhere on UL ; 

b) on every Simply-connected Subset of Z the vector field (10.1J) is Hamiltonian 
(with 达 (p,9) = 一 Im log(p 十 19) 十 Comsb); 

c) it is not possible to find a differentiable function 囊 : Z 一 疏 suchthat(10.1) 
is equal to .J 一 1LV 万 (pq) forall (po e 忆 7. 

Re1Ia1K The vector field (10.1) is locally (but not globally) Hamiltonian. 


. (Burnton 儿 Scherer 1998). Prove that all members of the one-parameter family 


of Nystrom methods of order 2s, constructed in Exercise IIU.9, are Symplectic 
and Symmetric. 


.Prove that the statement of Lemma 4.1 remains true for methods that are for- 


mally deftined by a B-series，@j(V) = 已 (a, JJ)， 


.Compnute the generating function 5S1(P, ,7 of a symplectic Nystrom method 


applied to 4 = 7 (q). 

Find the Hamilton-Jacobi equation (cf. Theorem 3.7) for the generating func- 
tion 952(D),Q) of Lemma 5.3. 

(Jacopis1retpodjporexactintegratiozz). Suppose wehaveasolution 9(qQ ,ta) 
of the Hamilton-Jacobi equation (3.16), depending on wparameters al ,. ,ad 
Such that the matrix 人 了 -】 ls invertible. Since thlis matrix 1ls the Jacobian 
of the System 








D9 
一 0 人 二 5 (10.2) 
Oai; 
this System determines a solution path 1 ...，,Qv which is locally unique. In 


possession of an additional parameter (and, including the partial derivatives 
with respect to 坊 an additional row and column in the Hessian matrix condi- 
tion)，we can also determine @i ( 轨 as function of 上 Apply this method to the 
Kepler problem (1I.2.2) in polar coordinates，where，with the generalized mo- 
menta Dr 一 7, Do 二 r2o, the Hamiltonian becomes 


1 1 
于 训 (人 避 直 区 
汪 





2 亿 
and the Hamilton-Jacobi differential equation (3.16) ls 
工 2 工 2 AI 
3 十 了 (Sr) 十 rz(3o) = 二 = 


Solve this equation by the ansatz 5(t mm P) = 01 (人 轨 十 02( 门 十 93(P) (Separation 
of variables). 


12. 


13. 


14. 
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Resu 人 One obtains 





dl 
5= /vaar2+2Mr 一 号 宁 +oap 一 at 
人 


Mr 一 a2 
an elementary integral. This, when resolved fory, leads to the elliptic movement 
of Kepler (Sect.I.2.2). This method turned out to be most effective for the exact 
integration of difGcult problems. With the Same ideas, just more complicated 
in the computations, Jacobi Solves in “lectures” 24 through 30 of (Jacobi 1842) 
the Kepler motion in 及 3, the geodesics of ellipsoids (his greatest triumph), the 
Imotion with two centres of gravity, and proves a theorem of Abel. 
(Chams ZLopatio 1114 1etpods.) Show that there exists a one-parameter family 
of symplectic, symmetric (and 4-stable) Runge-Kutta methods of order 2s 一 2 
based on Lobatto quadrature (Chan 1990). A Special case of these methods can 
be obtained by taking the arithmetic mean of the Lobatto JIA and Lobatto IJB 
method coefficients (Sun 2000). 
ZL Use the 帮 -transformation (See Hairer 色 Wanner (1996),p.77) by putting 
Ass_ 1 三 一 Xs。 lsanarbitrary constant. 
For a Hamiltonian system with associated Lagrangian Z(aq,d) 三 307 AM d 一 
U(d, show that every first integral T(D, q) = Da(d) resulting from Noether's 
Theorem has a linear a(dq) = 4d + c with skew-symmetric M 4. 
EL (a) Itls sufficlient to consider the case AM = 工 
(b) Show that a' (9) is skew-symmetric. 
(c) Let aij(d) 三 5 (q). Using the symmetry of the Hessian of each compo- 


Putting, e.g., 05/0as = 0, we obtain p = arcsin by evaluating 





nent ai(9q), Show that aij(9) does not depend on qi qj, and is at most linear in 
the remaining components dk. With the skew-symmetry of ao'(q), conclude that 
Qa' (9) = Comst. 

Consider the unconstrained op1i7GL com1yol Prople11 


Cl(dq( 站 ) 一 min 
4 的 一 jg 人 )， 4(0) =@ 


on the interval [0,7], where the control function is assumed to be continuous. 
Prove that first-order necessary optimality conditions can be written as 


4 = Vp 吾 (pb db 人)， q(0) 二 g 
2 = -VDub)， DPI) = YoclD) 004) 
Vs 如 CC db 人 的)， 


where the Hamiltonian ls given by 


刀 (Pg 三 夺 F0o) 


(we assume that the Hessian V2 万 (p, qu is invertible, so that the third relation 
of (10.4) defines w as a function of (D, 9)). 


(10.3) 


避 
| 


230 


15. 
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恺 AL Consider a slightly perturbed control function v( 方 十 sou( 轨 ，and let 
d( 四 二 sbd( 轨 十 O(zs2) bethe corresponding solution ofthe differential equation 
in (10.3). With the function p( 轨 of (10.4) we then have 


下 汪 
C'(e(m)5aCm) = 下 (praog)jd= 下 pz 的 7 记 (. .5u(bd 


The algebraic relation of (10.4) then follows from the fundamental lemma of 
variational calculus. 
ARunge-Kutta discretization of the problem (10.3) is 


OH 一 加 十 几 开 :- bi7(Qu TD) 
Q@mi 二 舌 十 尹 27-1057Qn Do) 


with 风 三 0 .YY 一 1and 岂 = 了 T/N.Weassumeu 天 0foral;i Introducing 
Suitable Lagrange multipliers for the constrained minimization problem (10.3)， 
prove that there exist pn, Zi Such that the optimal solution of (10.3) satisfies 
(Hasger 2000) 


dm+l 二 dmn 下 灵 》 ;1 UVp 古 (Pi Cni， Vi) 
Q@mz = gm 十 太 2 1057Vp 厂 (Po ni Da 
Po = 一 六 DVPoQay Da (10.0) 
Pa = 和 一 疙 2 六 1057Y9 LPGOn7 Da 
0 三 VPR Goni Dam) 











with pw = VC(qNw) and given initial value go，where the coefficients 了 and 
ij are determined by 


DZ 


六 一 Di DG 十 机 oj 态 2 (10.7) 


Consequently, (10.6) can be considered as a Symplectic discretization of (10.4); 
See Bonnans & Laurent-Varin (2000). 

(Hager 2000). For an explicit s-stage Runge-Kutta method of orderp 一 s and 
天 0, consider the partitioned Runge-Kutta method with additional coeffi- 
clients 访 and Qij defined by (10.7). Prove the following: 

al) ForDp 三 s 三 3,thepartitioned method is of order 3 寺 and only if cs = |. 

b) For2p = ss 三 4,thepartitioned method is of order 4 without any restriction. 


Chapter VII. 
Non-Canonical Hamiltonian Systems 


We discuss theoretical properties and the Structure-preserving numerical treatment 
of Hamiltonian Systems on manifolds and of the closely related class of Poisson 
Systems. We present numerical integrators for problems from classical and quantum 
mechanics. 


VIILI Constrained Mechanical Systems 


Constrained mechanical Systems form an Important class of differential equations 
on manifolds. Their numerical treatment has been extensively investigated in the 
context of dijjerem1ial-algepraic eqguatio1ns and is documented in monographs like 
that of Brenan, Campbell & Petzold (1996), Eich-Soellner & Fuhrer (1998), Hairer， 
Lubich &Roche (1989), and Chap.VII of Hairer 儿 Wanner (1996). We concentrate 
here on the Symmetry and/or Symplecticity of such numerical integrators. 


VIL.1.1 Introduction and Examples 

Consider a mechanical System described by position coordinates 091,...,qd,， and 
Suppose that the motion is constrained to satisfy g(d) = 0 where 9 : 玉 4 一 及 m with 
mr < d. LetT(a,d) = 507M(Og be the kinetic energy of the system and UV(a) its 
potential energy, and put 


7 人 =T( 人 一 VOD 一 9(O 和， (G.D 
where 入 一 (和 ...,Xm)7 consists of Lagrange multipliers. The Euler-Lagrange 
equation of the variational problem for 万 Z(doa) at is then given by 

Q 王 ) O7 0 
dd ar 


Written as a first order differential equation we get 


0 人 
MO = oo 一 GD》 (1.2) 
0 = 990)， 
where fwo) = - 交 (M(Oojo+YVoT(o) -Ver(g) and GO = 六 (0 
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了 上 xample 1.1 (Spbherical Pendulum). We denote by qil, qz, ga the Cartesian coor- 
dinates of a point with mass 7 that is connected with a massless rod of length L 
to the origin. The kinetic and potential energies are 全 一 轰 ( 妈 十 绽 十 好 ) and 
LU = 9q3 , TeSpectively, and the constraint ls the fxed length of the rod. We thus 
get the System 


和 三 Vi moO1l 一 一 201 入 

7 人 7 人 一 一 2 入 

睹 2 d2 (1.3) 
da 三 V3 03 三 一 729g 一 293 入 


0 = 邓 十 全 十 吕 一 人 妈 . 


The physical meaning of Ais the tension in the rod which maintains the constant 
distance of the mass point from the origin. 


xistence and Uniqueness of the Solution.。 A standard approach for studying 
the existence of solutions of differential-algebraic equations is to differentiate the 
constraints until an ordinary differential equation is obtained. Differentiating the 
constraint in (1.2) twice with respect to time yields 


0=G(q)v and 0=9g (ou 十 G(OY (1.4) 


The equation for v in (1.2) together with the Second relation of (1.4) constitute a 
linear System for w and 入 ， 


CO 


Throughout this chapter we require the matrix appearing in (1.5) to be invertible for 
d close to the Solution we are looking for This then allows us to express v and 入 as 
functions of (9 由 ). Notice that the matrix in (1.5) is invertible when G(d) has full 
rank and M(9) is invertible on ker G(q) = { 公 | G(q) 玉 = 01. 

We are now able to discuss the existence of a_ solution of (1.2). First of all,， 
observe that the initial values go, vo, Xo cannot be arbitrarily chosen. They have to 
satisfy the first relation of (1.4) and Xo = X(qo,v0),， where 和 (qd,v) is obtained from 
(1.5). In the case that go,Vvo, Xo Satisfy these conditions，we call them co7zsiste7zt 
17011QL vaLues. Furthermore, every Solution of (1.2) has to Satisfy 


4 三 V， 孜 二 DG9U)， (1.0) 


where D(q,v) is the function obtained from (1.5). It is known from standard theory 
of ordinary differential equations that (1.6) has locally a unique solution. This solu- 
tion (g( 轨 ,人 () together with 和 A( 昌 := (qd( 轨 ,uv( 轨 ) satisfies (1.5) by construction， 
and hence also the two differential equations of (1.2). Integrating the second relation 
of (1.4) twice and using the fact that the integration constants vanish for consistent 
initial values, proves also the remaining relation 0 = 9(d) for this solution. 
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Formulation as a Differential Equation on a Manifold. We denote by 


o={faog=0 (7) 


the comjisgxyatiom 17011 加 1 on which the positions qd are constrained to lie. The 
tangent Space atd E @is7T@ = {uv;i G(q)u = 0}. The equations (1.6) define thus 
adifferential equation on the manifold 


7T@={(oidsevs79={oio0)=0GOu=0)， (41.3) 


the lazge1t pu1adle of @. Indeed, we have just shown that for initial values (aqo, vo) E 
7G@ (ie., consistent initial values) the problems (1.6) and (1.2) are equivalent, so that 
the solutions of (1.6) stay on 了 @. 


Reversibility。 The System (1.2) and the corresponding differential equation (1.60) 
are Ieversible with respect to the involution p(q,v) = (qd 一 路 , 让 Fa 一 v) = 三 
dv). This follows at once from Example V.1.3, because the solution (aq,v) of 
(1.5) satisfies (aq, 一 v) = (d,V) 


For the numerical Solution of differential-algebraic equations “index reduction ” 
is a very popular technique. This means that instead of directly treating the prob- 
lem (1.2) one numerically solves the differential equation (1.6) on the manifold 
A 人 4. Projection methods (Sect.IV.4) as well as methods based on local coordinates 
(Sect.IV.S) are much in use. I one ls interested in a correct Simulation of the re- 
versible Structure of the Problem, the Symmetric methods of Sect.V.4 can be ap- 
plied. Here we do not repeat these approaches for this particular situation, instead 
we concentrate on the Symplectic integration of constrained Systems. 


VII.1.2 Hamiltonian Formulation 


In Sect.VI.1 we have seen that, for unconstrained mechanical Systems, the equa- 
tions of motion become more Structured 让 we use the momentum coordinates 
7 三 细 = M(ddin place of the velocity coordinates v = 4. Let us do the Same 
for the constrained System (1.2). As in the proof of Theorem VI.1.3 we obtain the 
equivalent System 


二 古 po,9) 
六 = 一 Bo(Dg) 一 G(O)7 (1.9) 
0 = 99)， 
Where 
瑟 2g) = 了 MO +U(9) (1.10) 


is the total energy of the system; 妃 p and 鼠 。 denote the column vectors of partial 
derivatives. Differentiating the constraint in (1.9) twice With respect to time,， we get 


由 二 和 (GOBO 四 ) 杞 9) 一 G(OEop(D,9) ( 束 @， 中 十 G(OTA) (12) 
4 
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and assuming the matriX 
G(g) 古 (aq)G( ID) isinvertible， (1.13) 


equation (1.12) permits us to express 和 A in terms of (2, 9). 


Formulation as a Differential Equation on a Manifold. Inserting the so-obtained 
function A(p,dq) into (1.9) gives a differential equation for (2, 9) on the manifold 


AM ={0);i9(0) =0 G(O)B(9) =0} (1.14) 


As we will now see, this manifold has a differential-geometric interpretation as 
the cotangent bundle of the configuration manifold @ = {d; 9(q) = 0}. The 
Lagrangian for a fixed g E Q@is afunction on the tangent Space 7T5@，i.e.， 
了 (9 ) :7T5@ 一 下 .Its (FrEcheb derivative evaluated at d < 7 @ is therefore a lin- 
ear mapping djZ(d,d) :7TQ@ 一 了 ,or in other terms, qdZL(dq,d) is in the cotangent 
space 77 @. Since the duality is such that (ds 世 (q, 0),u) 三 人 劳 (， dvuforve7Tr@， 
condition (1.13) ensures that the Legendre transform ad 尽 2D 三 do is an in- 
vertible transformation between TvQ@ and 了 7 Q@. We can therefore consider 2 Q as 
a Subspace of 取 4 这 every PE 7T7@ is identified with 劳 (4， dg = MooeRdfor 
the unique d E TvQ@ for which p = djZ(q,9) holds. With this identification， 


Tr@={AO0 ETe， 
and the duality is given by (p,vw) = p7u for p E 7T7Q@ andv Ee 7 包 . We thus have 


六 =Moe 人 Qifandonyif= MPp= 古 (pq ET@.Since the 
tangent spaceatdE @ is 7T9@ = {d;i G(qd = 0} we obtain that 


DET7O 让 and only 让 G(g) 瓦 (D,9) = 0. 


Denoting by 大 @ = {9q)i9ese@pe 人 Qthecotanrsgent pudle of @, we 
thus see that the constraint manifold 人 of (1.14) equals 


AM =7Q. (1.15) 


The constrained Hamiltonian System (1.9) with Hamiltonian (1.10) can thus be 
viewed as a differential equation on the cotangent bundle 7 @ of the configura- 
tion manifold Q@. 


In the following we consider the System (1.9)-(1.12) with (1.13) where 互 (P, 9g) 
is an arbitrary Smooth function. The constraint manifold is then still given by (1.14). 
The existence and uniqueness of the solution of (1.9) can be discussed as before. 


Reversibility， It is readily checked that the system (1.9) is reversible 旋 (一 D, 9) 王 
刀 (p,q). This is always satisfied for a Hamiltonian (1.10). 


Preservation of the Hamiltonian。 Differentiation of 刀 (z( 轨 ,9( 坟 ) with respect to 
time yields 
7 Tm 了 
-BT 机 一 FTGTA+BTB 
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with all expressions evaluated at (p(b,d( 人 四). The first and the last terms cancel， 
and the central term vanishes because CDp = 0 on the solution manifold. Conse- 
quently, the Hamiltonian 瑟 (2, 9q) is constant along solutions of (1.9). 


Symplecticity of the Flow. Since the fow of the System (1.9) is a transformation 
on A4, its derivative is a mapping between the corresponding tangent Spaces. In 
agreement with Definition VI.2.2 we call a map 2 : A4 一 人 Symplectic 刻 for 
every2 = (D,q) E 人 4， 


co(zj7yoz 和 = 好 Je foral ee 人 人 (1.16) 


If o is actually defined and continuously differentiable in an open Subset of 束 24 
that contains 人 , then 2'(Zz) in the above formula is just the usual Jacobian matrix. 
Otherwise, Some care is necessary in the interpretation of (1.16): P' is the tangent 
map given by the directional derivative 2'(z)5 := (d/dr)l>-o 2(3y(r)) fore E 
TrA, whereyisapath on 人 M with 7(0) = z,3(0) = 和 .The expression 上 ol(z) 
in (1.16) should then be interpreted as (2'(z)6) 


Theorem 1.2. Zef 克 (P,dq) and 9g(a) pe twice comtiatosiy dierentiaple. 7Tjpe Jom 
oO A 人 一 人 (ofijesyyste1a (1.9) 08 1Pe1z asSy1iD1ectic 1ra118s1o7raatio1a on 人 1 Le 这 
SG1is1jresy (7.70). 


Piroof We letz = (2D,g), so that the system (1.9) becomes 之 = .JI(V 感 (z) 十 
> ;Xi(z)Vgi(z))，where Xi(z) and gi(z) are the components of X(Z) and g(z)， 
and A(Z) is the function obtained from (1.12). The variational equation of this syS- 
tem, Satisfied by the directional derivative 到 = pit(Zo)6, with zo = (Po, 9o), reads 


旋 = , 广 :1 (Yo) + No)v2gio+》， Vy(z)YX(nz7) 克 


2 一 1 


A direct computation, analogous to that in the proof of Theorem VIL.2.4, yields for 
6 E To 人 4 
忆 7 
(Soleoryeteoe) = = Eveorvo)YNJrettooe 
一 1 


- > 6ol(zoj7VAi(z)Voi(z)7eol(zo)sa. (1.17) 
3 


Since gi(et(zo)) = 0 for zo Ee AM, we have Vgi(z)7ef(zojla = 0 and the same 
for 上 il, so that the expression in (1.17) vanishes. This proves the Symplecticity of the 
fow on 人 4. 














Differentiating the constraint in (1.9) twice and solving for the Lagrange multi- 
plier from (1.12) (this procedure ls known as “index reduction ”of the differential- 
algebraic System) yields the differential equation 


4= 囊 (9)， 方 = 一 本 (pg9) 一 G(O7 AP,9)， (1.18) 
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.0002 














一 .0002 


Fig. 1.1. Numerical solution of the symplectic Euler method applied to (1.18) with 瑟 (p, 9) 三 
3(D1 十 02 十 D3) 十 qa,g(q) = 十 吗 十 9 一 1(Gpherical pendulum); initial value ao 一 
《0, sin(0.1), 一 cos(0.1)), po = (0.06,0, 0), step size 几 = 0.003 for method“SE”(without 
projection) and 妨 = 0.03 for method “SEproj”(with projection) 








Where 和 (D, 9) is obtained from (1.12). 开 we solve this system with the symplectic 
Euler method (implicit in p, explicit in g), the qualitative behaviour of the numeri- 
cal Solution is not correct. As was observed by Leimkuhler 儿 Reich (1994), there 
is a linear error growth in the Hamiltonian and also a drift from the manifold A4 
(method “SE” in Fig. 1.1). The explanation for this behaviour is the fact that (1.18) 
is no longer a Hamiltonian System. I we combine the Symplectic Euler applied 
to (1.18) with an orthogonal projection onto 人 A4 (method“SEproj), the result im- 
proves conslderably but the linear error growth in the Hamiltonian is not elliminated. 
This numerical experiment illustrates that “index reduction” is not compatible with 
Symplectic integration. 


VJIL1.3 A Symplectic First Order Method 


We extend the Symplectic Euler method to Hamiltonian Systems with constraints. 
We integrate the Dp-variable by the implicit and the 9-variable by the explicit Euler 
method. This gives 


区 +1 二 Dp 一 用 ( 瑟 5( 轴 + dn) 计 G(dn)7Xn+Hl) 
dn+l 三 qdn 十 尺 五 (PHl， dn) (1.19) 
0 = 9(dn+1). 





The numerical approximation (Z+1, an+1) Satisfies the constraint 9g(dq) = 0, but 
not G(9q) 瑟 (pp,9) = 0. To get an approximation (pn+l;,qn+1) < 人 ,we append 
the projection 
mp+Ll 一 有 +1 二 挛 G(qnH)7 n+1l 
0 = Gan+1)p(Dn+lgn+1). 


Let us discuss Some basic properties of this method. 


(1.20) 


了 xistence and Uniqueness of the Numerical Solution。. Inserting the deftinition 
of qn+l from the second line of (1.19) into 0 = 9(an+1) gives a nonlinear System 
for 罗 +1 and 刀 Xn+1. Due to the factor 屎 in front of 刀 p( 思 +1,dn)， the implicit 
function theorem cannot be directly applied to prove existence and uniqueness of 
the numerical Solution. We therefore write this equation as 
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0 = 9g(qn+1) = 9g(dn) 下 G(w 二 7(dan+l 一 om))(orl 一 an)a7. 


We now use 9g(dn) = 0, insert the deftinition of qd +1 from the second line of 
(1.19) and divide by 几 . Together with the first line of (1.19) this yields the System 
环 (页 + An 请 ) 一 0 with 


嘱 一 因 十 PP 十 G(on)7z 


乒 》 用 一 4 
人 三 co + 本 weg) 本 wo)dr 
0 


Since (pn,dn) E 人 with 人 from (1.14), we have 玉 (pn,0,0) = 0. Furthermore， 


Or 本 了 G(o 并 
了 林 必 ,09 一 人 0 】 


and this matrix is invertible by (1.13). Consequently, an application of the implicit 
function theorem proves that the numerical solution (及 ,+1, 六 An+i) (and hence also 
dn+l) exists and is locally unique for sufficiently small 中. 

The projection Step (1.20) constitutes a nonlinear System for pn+1 and PNLn+1l， 
to which the implicit function theorem can be directly applied. 


Convergence of Order 1. The above use of the implicit function theorem yields 
the rough estimates 


有 Hpn+OOJ AH 三 O)， Puntl=O(A)， 


which, together with the equations (1.19) and (1.20), give 


qdnH1=d(t)TTOO2，pn+l=ptntHi) 一 GottnD) zx 十 O()， 


where (p(b),d 人 的 ) is the solution of (1.9) passing through (pn gn) <e AM att = 如: 
JInserting these relations into the Second equation of (1.20) we get 


0= GoO) 肋 (pt0,aG0) +G(eO) 本 (eaG)G(eD) +O() 


at 上 一 如 4H1.Since G(dO) 忆 (pb,d 的 ) = 0,itfollows from (1.13) that 7” 一 
O(12). The local error is therefore of size O(]2). 

The convergence proof now follows Standard arguments, because the method is 
a mapping 2 : 人 4 一 人 on the solution manifold. We consider the Solutions 
(pn(b,gqn 信 ) of (1.9) passing through the numerical values (pn gn) < AM att 一 
娓 ,we estimate the difference of two Successlve Solutions in terms of the local error 
at 如 ，and we Sum Up the propagated errors (See Fig.3.2 of Sect.I.3 in Hairer， 
Ngrsett & Wanner (1993)). This proves that the global error satisfies pn 一 D( 恕 ) 三 
O(P) and om 一 dg( 妇 ) 三 O(P) aslong as 如 三 7 下 所 Comst. 
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Symplecticity。， We first study the mapping (pn,dn) 呈 ( 芭 +1,dn+i) defined by 
(1.19), and we consider Xn+1 as a function A(Pu, an). Differentiation with respect 


to (pn,qn) yields 


了 十 彤 万 也 0 ODN (TI-pnGTAX 5 一 pnG7Xs (1.21) 

一 hnDpp 了 工 (pn, qdn) 0 7 二 AD /” ” 
where 9 = 一 AT 一 AZ god is a Symmetric matrix, the exXpressions 五 wo, 刀 pm， 
丽 v G are evaluated at (DT1,gqn), and 和 Ap, 》Xo at (pn qn). Acomputation, iden- 
tical to that of the proof of Theorem VI.3.3, yields 


人 人 _ 0 了 一 PAEG 
OUn， dn) OUn， dn) 一 了 十 PG7 Xp PC(G7 人 X3G) 





We multiply this relation from the left by 41 E 7T(pwa) 人 W and from the right by 
6 <E 7Tp。om)At. With the partitioning & 一 人 Spy) 2 have G(qu)co = 0 for 
了 = 12 so that the expression reduces to 上 Je. This proves the symplecticity 
ii (1.16) for the mapping (pn,dn) 上 (Pidn+i) 

Similarly, the projection step (Za mm (pn+ridn+li) of(1.20) gives 


D(Dn+1 dn+1) 2 ( 有 PGYJ 和 一 
D(ZH1， dn+1) 0 了 2 








Where Hn+l of (1.20) is considered as a function of (Z+1dn+l)，and 9 = 
一 Pd goa， This is formally the Same as (1.21) with 媚 三 0. Consequently, the 
Symplecticity condition ls also Satisfied for this mapping. As a composition of two 
Symplectic transformations, the numerical fow of our first order method is therefore 
also Symplectic. 


1.0 
component qs Symplectic Euler 





1 1 | 
50 100 


component 
.5 人 implicit Euler 








上 上 上 上 | 上 上 上 上 | 
50 100 
Fig. 1.2.。 Spherical pendulum problem solved with the Symplectic Euler method (1.19)- 
(1.20) and with the implicit Euler method; initial value qo 三 (sin(1.3),0,cos(1.3))， 
Do = (3cos(1.3),6.5, 一 3sin(1.3)), step Size 中 = 王 0.01 
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Numerical Experiment，Consider the equations (1.3) for the spherical pendulum. 
For a mass 7 = 1 they coincide with the Hamiltonian formulation. Figure 1.2 
(upper Picture) Shows the numerical solution (vertical coordinate 9q3) over many 
periods obtained by method (1.19)-(1.20). We observe aregular qualitatively correct 
behaviour For the implicit Euler method (ie., the argument qn is replaced with qi1 
in (1.19)) the numerical solution，obtained with the Same Step Size and the Same 
injtial values, ls less satisfactory. Already after one period the solution deteriorates 
and the pendulum loses energy. 


VJIL.1.4 SHAKKE and RATITLE 


The numerical method (1.19)-(1.20) is only of order 1 and it is not Symmetric. An 
algorithm that is of order 2, Symmetric and Symplectic was originally considered for 
Separable Hamiltonians 


BO = 了 TMPp+U (1.22) 


With constant mass matrix AM. Notice that in this case we are concerned with a 
second order differential equation Mi = -UVv(9) - G(q7A with g(9) = 0. 


SHA 天 丘 ，Ryckaert Ciccotti 多 Berendsen (1977) propose the method 


dan+1 一 2dn 十 qn-1l 
0 


一 j2 1 一 1 (DZx(dn) 十 G(on)7 Xn) 


1.23 
9(dan+i1) 


for computations in molecular dynamics. It is a straightforward extension of the 
Stormer-VYerlet Scheme ([.1.15). The p-components, not used in the Tecursion, are 
approximated by pn = M(wnHi 一 o_D/27 


RATTLE. The three-term recursion (1.23) may lead to an accumulation of round- 
off errors, and a reformulation as a one-step method is desirable. Using the Same 
procedure as in (1.17) we formally get 


凡 
Dnp+1l/2 二 Drn 一 于 (Ze(dn) 二 G(on)7Xn) 


dn+l = 和 十 jMTIpnrliy2， 0 三 9(n+1) (1.24) 
风 
Dnp+l 二 Dn+T1/2 一 本 (De(dn+1) 可 G(qan+1)7 Am). 





The difficulty with this formulation is that An+l is not yet available at this step (it 
is computed together with on+2). As aremedy, Andersen (1983) Suggests Teplacing 
the last line in (1.24) with a projection Step Similar to (1.20) 


用 
DPn+1 三 Prn+1l/2 一 本 (De(dn+1) 5 G(dw HUan) 
0 = G(wTD)MT pn+l， 


(1.25) 
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This modiftication, called RATITLE, has the further advantage that the numerical ap- 
proximation (pn+l,dn+li) lies on the solution manifold A4. The symplecticity of 
this algorithm has been established by Leimkuhler 儿 Skeel (1994). 


了 xtension to General Hamiltonians. As observed independently by Jay (1994) 
and Reich (1993), the RATITLE algorithm can be extended to general Hamiltonians 
as follows: for consistent values (pn, an) E AM define 


几 


Dn+l/2 二 Drn 一 林 (Be(pn+1/2， dn) 必 G(an)7Xn) 








岂 
qdn+l 三 dmn 十 可 (BCpn+l/2， dm) 下 五 p(Pn+1/2， dn+1)) 


0 = (os 





几 
DPn+l 三 Dn+1/2 一 可 (Bo(pn+1/2,gn+1) 十 G(oH1i)7un) 


Con) 万 p(Dn+l， dn+1). 


The first three equations of (1.20) are very Similar to (1.19) and the last two equa- 
tions to (1.20). The existence of (locally) unique solutions (Pu+1 /2， gmn 二 1， Xn) and 
(pnr+l,Hn) can therefore be proved in the same way. Notice also that this method 
gives anumerical solution that stays exactly on the solution manifold A4. 


一 
| 


Theorem 1.3. 77Pe 1U1e7ical etpod (1.26) 18 8Sy111111etric，Sy11D1ectic，a12d co11vVe 六 
Se1t Of ojder 1W0. 


Prooj Although this theorem is the Special case 5 三 2 of Theorem 1.4, we outline 
its proof. We will see that the convergence result is easier to obtain for s = 2 than 
for the general case. 

If we add to (1.26) the consistency conditions 9g(qn) = 0, G(qu) 瓦 (pn dan) 一 
0 of the initial values, the Symmetry of the method follows at once by exchanging 
六 全 一 几 DOnp+1 过 Dny dgn+l 过 gdnandXn 过 1 Thesymplecticity can be proved 
as for (1.19)-(1.20) by computing the derivative of (pn+1l,qn+1) with respect to 
(pn, dan), and by verifying the condition (1.16). This does not seem to be Simpler 
than the Symplecticity proof of Theorem 1.4. 

The implicit function theorem applied to the two Subsystems of (1.26) Shows 


Dn+l1/2 三 Dn 十 OA，AA 王 ON)， pn+l 王 Dn+1/2 十 OA， ju 三 OU 
and, inserted into (1.26), yields 


dm 二 1 一 da(tn+1) 二 O(12)， mn 十 1 一 D(tn+1) 2 Gain) 二 O(). 


Convergence of order one follows therefore in the Same way as for method (1.19)- 
(1.20). Since the order of a symmetric method is always even, this implies converT- 
gence of order two. 














An easy way of obtaining high order methods for constrained Hamiltonian SyS- 
temas ls by composition (Reich 1996a). Method (1.20) is an ideal candidate as basic 
integrator for compositions of the form (V.3.2). The resulting integrators are SYym- 
metric, Symplectic, of high order, and yield a numerical solution that Stays on the 
manifold A4. 
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VII.1.S The Lobatto IHUIA - HIB Pair 


Another posSibility for obtaining high order Symplectic integrators for constrained 
Hamiltonian Systems ls by the use of partitioned Runge-Kutta or discontinuous col- 
location methods. We consider the system (1.9) and we search for polynomials 炎 ( 女 
of degree s, (人 (t of degree s 一 1,andvu(ti ofdegree s 一 2sSuchthat 


ut 如 ) 一 qn， u 人 (如 ) = Dn 一 六 D10( 刀 ) (1.27) 
with the defect 
5 人 一 2 的 十 丽 ( 人 (ou 二 G(eD) wo (1.28) 
and, using the abbreviation 如 ; 三 如 十 ci 用， 
芯 ( 如 一 奋 p(o 人 tu 人 ti)， 人 (1.29) 
站 和 = 一 丽人 (要 和 让 一 Gu ao 2 一 1 


0 = g(u( 如 让)， 一 1 5. 
于 these polynomials exist, the numerical solution is defitned by 


danH1 三 MU( 刀 十 癌 ，， pn 三 2( 刀 十 站 一 Pbs6( 加 十 力 人 
0 三 G(qn+1)Ep(pn+l,9gn+1). 


Why Discontinuous Collocation Based on Lobatto Quadrature? At a first 
glance (Theorem VI.4.2) it seems natural to conslider collocation methods based on 
Gaussian quadrature for the entire System. This, however has the disadvantage that 
the numerical solution does not satisfy g(qn+1) = 0. To achieve this requirementb， 
刀 十 尹 has to be one of the collocation points, ji.e., we must have cs 三 1. Unfortu- 
nately, none of the collocation or discontinuous collocation methods with cs。 = 1 is 
Symplectic (See Exercise IV.0). We therefore turn our attention to partitioned meth- 
ods, and we treat only the q-component by a collocation method satisfying cs。 三 1. 
To satisfy the s conditions g(u(ta)) 一 0 of (1.29) there are only s 一 1 free pa- 
Iameters (如 ), 人 ( 刀 十 cz 由 uU( 志 十 co-iP) available. A remedy is to choose 
cl 三 0 So that the first condition g(u(tn)) = 0 is automatically verified. EncouT- 
aged by TIheorem VI.4.3 we are thus led to consider the Lobatto nodes in the role 
of the ci. The use of the partitioned Lobatto IIUJA - IIUB pair for the treatment of 
constrained Hamiltonian Systems has been Suggested by Jay (1994, 1990). 


Existence and Uniqueness of the Numerical Solution。 The polynomial w(t of 
degree s is uniquely determined by v( 如 ) 三 qn and (如 ii) 三: Qi = 了 1. .59)， 
the polynomial v( 坟 of degree s 一 2 is uniquely determined by v(t 如 ii) 三 : 已 (三 
1,...,s5 一 1), and the polynomial w(b of degree s - 1 is uniquely determined 
by hao( 如 让 =: 4 三 1...),5). Notice that the value 4。is only involved ip 
(1.30) and not in (1.27)-(1.29). For the nonlinear System (1.27)-(1.29) we therefore 
Conslder 
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三 = (Go 及,P 144 1) 

as independent variables, and we write the System as 下 (X,]) = 0. The function 
忆 is composed of the s conditions for 飞 ( 刀 i, of the definition of v( 妃 ) (divided 
by Di) and the s -- 2 conditions for zf 如 zi) (multiplied by 几 ,， and finally of the 
s 一 1 equations 0 = g(u( 恕 让) for1 = 2,...,s (divided by 办. Observe that 
0=9g(u(t 如 )) is automatically satisfied by the consistency of (pn, gn). We note that 
已 =v( 刀 十 尹 ) and 户 = 1 人 (ti) are linear combinations of 已 ，..，, 忆 1 with 
coefficients independent of the step Size 几 . 

The function 玉 (X,]) is well-defined for 刀 in aneighbourhood of 0. For the first 
two blocks this ls evident, for the last one it follows from the identity 


1 和 

和 g(u 人 (ti)) = / G(u( 如 十 gj)) 习 (二 十 gm) db 
using the fact that 如 (tn 十 gj is a linear combination of @i fori = 1 ...,s. With 
the values 


0 二 (可 (pn dm)， 5 .万 p(pn an),Dn， 汪汪 ,Dn， 0， 到 演 .0) 


we have that 严 (Xo,0) = 0, because the values (pn,dqn) are assumed to be consis- 
tent. In view of an application of the implicit function theorem we compnute 


BF TQ@T 一 D@ 万 np 0 
5 (Co0) 二 0 BT 7TQ@GI |， (1.31) 
4@G 0 0 


Where 五 pp， G are evaluated at (pn, qn), and 4, 忆 ,Dare matrices of dimension 
(s 一 JJ)xs,(s 一 JJ)x(s 一 1)andsx(s 一 Jrespectively that depend only on the 
Lobatto quadrature and not on the differential equation. For example, the matrix 已 
Tepresents the linear mapping 


( 书 ， 二 二 有 5 | 一 ( 恶 十 困 世 , 户 ， 局 , 忆 _1). 


This mapping is invertible, because the values on the right-hand side uniquely de- 
termine the polynomial v(t of degree s 一 2. 
Block Gaussian elimination then Shows that (1.31) is invertible 过 and only 寺 the 
ImatriX 
4DB-IQ@GBpG7 isinvertible. 


Because of (1.13) it remains to show that 4DB-1Lis invertible. 

To achieve this without explicitly computing the matrices 4 ,已 ,D, weapply the 
method to the problem where p and d are of dimension one, 刀 (pz,q) = 22/2, and 
0(d) = qd. Assuming 忠 王 1 we get 

u(0) = 0， Vu(0) = 一己 (2(0) 十 w(0)) 
zci) 一 V(ci for 一] 十...，,5S 

(ci) = (ci) 4132) 
of(ci) = 一 w(ci) for 一 2),... ,5 一 1 


0=Vw(ci) for 1 一 工 ...，,5， 
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which is equivalent to 


ci)) 


T -D 0 (&(ci))2 1 0 
0 瑟 了 (fc)) | =|0|， (1.33) 
和 


because 万 pp(D,9) = 1 and G(q) = 1. Since v( 轨 ia polynomial of degree s， 
the last equation of (1.32) implies that (三 CTT-L 一 cj). By the second 
relation the polynomial (为 一 v(, which is of degree s 一 1, vanishes at s points. 
Hence, uv 人 (tb) 三 飞 ( 办 , which is possible only 证 C = 0, because the degree of uv 人 (区 is 
s 一 2. Consequently, the linear System (1.33) has only the trivial solution, So that the 
matrix in (1.33) and hence also 4DB-Lis invertible. 

The implicit function theorem applied to 严 (X, 几 ) = 0 shows that the nonlinear 
System (1.27)-(1.30) possesses a locally unique Solution for sufficiently Small step 
Sizes 几 . Using the free parameter 4。 = ju( 刀 十 几 ), afurther application of the 
implicit function theorem, this time to the Small System (1.30), proves the existence 
and local uniqueness of pn 二 1. 


Theorem 1.4. Ze (bi ci)2 1 pe 1pe Weigjp1s ad 11odes of 1he ZLopatto quad1atu1e 
(cf (177 Te aetpod (1.27)-(01.29)-(71.30) 18 8y1a111etric, Sy111D1ectic, Ga11d SUPer- 
CoO1Ve1gSe1t of order 25 一 2. 


Proof 9y11az111ety To the formulas (1.27)-(1.29)-(1.30) we add the consistency re- 
lations 9g(qn) = 0,，G(qn) 古 (pn dan) = 0. Then we exchange (如 ,Dnygn) 候 
( 妃 +1Dnldnri)and 六 全 一 .Sinceb 一 bsandcsli 王 1 一 cifortheLobatto 
quadrature, the resulting formulas are edquivalent to the original method (see also the 
proof of Theorem V.2.1). 

7 


Sy111DlecticiDy， We fix 上, 和 E To 人 weput Zn 三 (pn,dn) ,and we consider 


the bilinear mapping 


新 。 韦 | 下 S( (2 (3 区 十) (2) (区 ))e 











Dz， ”Drn DOzn Dzn Dzn DOzn 


The symplecticity of the transformation (pn, an) 上 (pn+l,dn+l) on the manifold 
人 4 is then expressed by the relation 


(3 光 -9( 绾 ,到 ) 0 


We now follow closely the proof of Theorem IV.2.3. We consider the polyno- 
mials ut,u(t,uw 人 的 of the method (1.27)-(1.29)-(1.30) as functions of 上 and 
2mn 一 (pn,dn),and we compnute 











人 


?tdQ@y7rau 人 au 介 
加 人 dt 人 Dz ”bzn ) 必 


(1.35) 
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Since v(b is a polynomial of degree s and v(t of degree s 一 2, the integrand in 
(1.3S) is apolynomial in tof degree 2s 一 3.Itis thus integrated without error by the 
Lobatto quadrature. By deftinition these polynomials satisfy the differential equation 
at the interior collocation points. Therefore, it follows from (1.17) that 


dQ 人 : | 二 





for 三 2,...,5 一 |， 








化 Ozn ， On 
and that 
QQ Doti) Du 人 ti 1600( 加 让 Du 人 (ta 于 加 
直人 ( OZn Dzn 总 红 DZ ) Dz ) for 72 王 1 and 1 一 5. 


Applying the Lobatto quadrature to the integral in (1.33) thus yields 


D6( 妇 ) Du(t) D6( 如 +1) | 


有 





and the Symplecticity relation (1.34) follows in the Same way as in the proof of 
Theorem IV.2.3. 


UpDe1comyve1ge1ce. This is the most difficult part of the proof. We remark that Super- 
convergence of Runge-Kutta methods for differential-algebraic Systems of index 3 
has been conjectured by Hairer, Lubich 儿 Roche (1989), and a first proof has been 
obtained by Jay (1993) for collocation methods. In his thesis Jay (1994) proves SU- 
perconvergence for a more general class of methods, including the Lobatto JIUA - 
JUB pair using a “rooted-tree-type”theory. A Sketch of that very elaborate proof 
is published in Jay (1996). Using the idea of discontinuous collocation, the elegant 
proof for collocation methods can now be extended to cover the Lobatto IJIA - IJB 
pair. In the following we explain how the local error can be estimated. 

We consider the polynomials (tv 人 defined in (1.27)-(1.29)-(1.30)， 
and we define defects (人 轨 ,6( 坟 ,0 as follows: 


尼 提 = 丽人 (的 ) 十 A 
-于 (ou 的 一 Gu w+5G (1.36) 
0 = 9g(uv 人 的 ) 二 9 


By deftinition of the method we have 


避 . 
es 
c 
SR 


克 枯 于 机 册 二 10， 
0( 刀 斩 ci 几 ) 一 0， 全 225 一 下 〈1.37) 
0( 刀 十 cih) 一 0， 2 | .5. 





We let q( 坟 ,D( 坟 ,入 (区 be the exact solution of (1.9) satisfying dg( 妃 ) = qn, D( 如 ) 三 
2Dn, and we consider the differences 


Au0=ua0-o0，4o0=og-PD，4Ao( = -AD 
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Subtracting (1.9) from (1.36) we get by linearization that 


Lu = ai(Auv+az 人 (Au+AO 


(1.38) 
Au 一 a2l( 切 Au 本 Q22 ( 坟 Av 生 a23( 臣 Au 十 0(b)， 








where alz( 塌 = 如 op(p( 的 ,9 的 ),，and where the other aij( 雪 are given by similar 
expressionSs. We have Suppressed quadratic and higher order terms to keep the pre- 
Sentation as Simple as posSsible. They do not influence the convergence result. To 
eliminate Aw in (1.38), we differentiate the algebraic relations in (1.9) and (1.36) 
twice, and we Subtract them. This yields 


0 = 下 (人 的 ) 十 太 扩 Au 十 妇 ( 人 Ao 十 巨人 Auw 
+ GOb)Es 人 ob)5 的 上 GO)AGbD 二 00b， 


Where 焉 ( 志 信 , 肆 坟 ,Di( 坟 ,ba 人 (区 are functions depending on p(,， qd 人 ,入 (， 人， 
u( 人 bu 人 ,and where 大人 0) =0and 巨 四 福 Go)Bp(pnqn)G(an)7. Because 
of our assumption (1.13) we can extract Aw from this relation, and we insert it into 
(1.38). Pn this way we get a linear differential equation for Avw, Av, which can be 
Solved by the“variation of constants”formula. Using Au( 刀 ) = 0 (by (1.27)), the 
solution Av( 刀 十 jis seen to be of the form 


tm 十 及 
Au 人 起 十 站 = Ra 人 起 十 彤 起 )Ao(ta) 十 (Rio 十 记 buG 
志 


FF 帮 , 有 (5 人 (十 六 (tb) 十 cl ( 昌 产 ( (1.39) 


+ CUD(G(wD)Ee(e(D,u(D)5(D 上 59) ) ) qt 





十 Raa( 如 - 





where Ri and 及 22 are the lower blocks of the resolvent, and 五 ,cl)C are functions 
as before. To prove that the local error of the p-component 


Dr 一 D( 刀 十 站 三 Ao( 刀 十 尹 一 Ps0( 如 十 用 (1.40) 


is of size O(P2s-I)，we first integrate by parts those expressions in (1.39) which 
contain a derivative. For example， 


忒 十 1 





-angw=o0e) 


也 


太 won0w=agug 


7 


because HU 人 (如 ) = WA( 刀 十 几 ) = 0by(1.37) and an application ofthe Lobatto quadra- 
ture to the integral at the right-hand side gives zero as Tesult with a quadrature error 
of size O(12s-1). Similarly, integrating by parts twice yields 


人 / 0 





tm 世 m 


a(tn+H1)g(tnt1) 全 a( 如 )g(t) 十 OU 


也 
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To the other integrals in (1.39) we apply the Lobatto quadrature directly. Since 
忆 2a(tnltn+i)is the identity, this gives 


Da pt 一 Raz 人 tn 如)( Au 人 tn) 环 hox5ttn)) (1.40) 
+ Cos)(AbsG(uttsrD))Ep(ottn)ultnta)a(tnt)+6GtntD) 


+ Go)(aonG(uett))Boto)att))5t)- 6)+OU2-9)， 








where CU = RDCO. The term Au( 如 ) 十 1016( 如 ) vanishes by (1.27)， 
and differentiation of the algebraic relation in (1.36) yields 


0 = G(u 食 ) ( 王 (e(D,w0D) 十 AD) 十 0()， 


As aconsequence of (1.27), (1.37) and the consistency of the initial values (pn dm)， 
this gives 
b(t) 二 人 径 CG(qn)Dp (pn 一 PD10(trn)， dn】 
= hiG(oan)Eop(pn an)5(t) 二 O(25(to) 
PoiG(u 人 ta) 加 ui))5()+O(26(t). 


Using (1.30) we get in the Same Way 
bn) ES PhDsG(u 人 ta))Eop(u(tn+ih， ua 人 (t+i))(nsi) 夫 O(125(tnr). 


These estimates together show that the local error (1.41) is of size O(P2s-1) 十 
O(125( 人 2). The defect 5() vanishes at s 一 2 points in the interval [如 ,ti], so 
that 6( 塌 = O(1s 2) fort e [如 如 +] (for arigorous proof of this statement one 
has to apply the techniques of the proof of Theorem JI.1.3). Therefore we obtain 
pn 一 pt =O(h2s 2),andbythesymmetry ofthe method also O(P2s 一 1)， 

In analogy to (1.39)，the variation of constants formula yields also an ex- 
Pression for the local error qn+1 一 q(t+1) = Au(t 恕 +). One only has to re- 
place Ra2l and R2 with the upper blocks Ril and Ri of the resolvent. Using 
Ria(tn+ltn+l) = 0, we prove in the same way that the local error of the q- 
component is of size O(jP2s 一 1). 

The estimation of the global error is obtained in the Same way as for the first 
order method (1.19)-(1.20). Since the algorithm is a mapping 2 : Ad 一 人 on the 
solution manifold, it is not necesSary to follow the technically difficult proofs in the 
context of differential-algebraic equations. Summing up the propagated local errors 
proves that the global error satisfies pn 一 D( 妃 ) = O(22s 2) and an 一 qd( 妃 ) 一 
@O(j2s-2) as long as 如 = 7 刀 芭 Const 














VIL1.6 Splittings Methods 


When considering splitting methods for constrained mechanical Systems, it Should 
be borne in mind that Such Systems are differential equations on manifolds (See 
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Sect.VIL.1.2)，Splitting methods Should therefore be based on a _ decomposition 
jy) = jj) + FDI(W) where both j 国 (wy) are vector fields on the same man- 
ifold as Foy). Let us consider here the Hamiltonian system (1.9) with Hamiltonian 


万 ( 亿 及 二 互 站 扣 人 厅 洒 再 后 0 区 (1.42) 
The manifold for this differential equation 1s 
AM={Dg)190) =0 G(OB(9) =0} (1.43) 


Notice that (1.9), when 万 is simply replaced with 五 回 , is not a good candidate for 
Splitting methods: the existence of a solution is not guaranteed, and if the solution 
exists it need not Stay on the manifold A4. The following lemma indicates how 
splitting methods should be applied. 


Lemma 1.S. Comsider a amailtomnian (7.42) afpnction g(9q) wii G(q) = 9 (9q)， 
G11d Lef 1he 111a1z1fo1d A4 pe give1l py (7.43) 达 (7.73) 1olds amd 洲 


G(OEMgJ =0 Joral (ge Ah (44) 
加 e1l 加 e yyste11 
区 二 
六 = 一 Bio) 一 G(O7A (1.45) 


0 = GOEn(D9) 


dejijzeysaadijjerential eguatio1l oa 1je 1ia111l AL and is Jomwiasy1ipiectic 1ra11y- 
oratiom om 人 4. 


Prooj Differentiation of the algebraic relation in (1.43) with respect to time, and 
replacing qd and DP with their differential equations,，yields an explicit relation for 
入 一 X(D,d) (as a consequence of (1.13)). Hence, a unique solution of (1.45) exists 
locally 计 G(qo) 羽 (po, go) = 0. The assumption (1.44) implies 艺 g(a() = 0.This 
together with the algebraic relation of (1.45) guarantees that for (po,qo) E A the 
Solution Stays on the manifold A4. The symplecticity of the fow is proved as for 
Theorem 1.2. 














Suppose now that the Hamiltonian 碌 (2z,dq) of (1.9) can be split as in (1.42)， 


where both 五 回 (p, g) satisfy (1.44). We denote by 站 the fow ofthe system (1.45). 


下 these fows can be computed analytically, the Lie-Trotter splitting o 问 O oh and 


the Strang Splitting 蜀 O 2 问 O 六 yield first and second order numerical inte- 
grators, fesSpectively. Considering more general compositions as in (1.3.6) and using 
the coefficients proposed in Sect. V.3, methods of high order are obtained. They give 
numerical approximations lying on the manifold 人 4, and they are Symplectic (also 


Symmetric 过 the splitting is well chosen). 
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For the important Special case where 


瓦 (D,9) = 工 (D,9q) 十 U(9) 


is the sum of the kinetic and potential energles, both Summands satisfy assumption 
(1.44). This gives anatural Splitting that ls often used in Practice. 


了 xample 1.6 (Spherical Pendulum). We normalize all constants to 1 (cf. Exam- 
ple 1.1) and we consider the problem (1.9) with 


( 字 十 吧 十 邓 一 1， 


MP 1 博 


1 
豆 (p,g)= 了 1 有 到 二 四 ) 十， 9(9) = 


We split the Hamiltonian as 瓦 H(p, 9) 一 下 (2 十 友 十 妇 ) and 万 P(p, dg) = ga， 
and we Solve (1.45) with initial values on the manifold 


人 = fg 1 呈 十 吧 十 归 一 1 =0, pigl 二 podz 十 psds 一 0)}. 
The kinetic energy 刀 吕 (p, g) leads to the system 


4=Dp， 05=-4A， dp=0， 
which gives 入 = D1D0， so that the flow o is just a planar rotation around the 
origin. The potential energy 万 站 (p, g) leads to 


=0， = 一 (001D)7 一 MX， dp=0. 
[2] 


The fow fkeeps d(t constant and changes p(b) linearly with time. Splitting 
methods give Simple, explicit and Symplectic time integrators for this problem. 


VIIL2 Poisson Systems 


This section ls devoted to an interesting generalization of Hamiltonian Systems， 
where .三 1 in (VIL2.5) is replaced with a nonconstant matrix 已 (y). Such struc- 
tures were introduced by Sophus Lie (1888) and are today called Poissom sysSte1118. 
They result in particular, from Hamiltonian Systems on manifolds written in non- 
canonical coordinates. In a first Subsection, however, we discuss the Polisson struc- 
ture of Hamiltonian Systems in canonical form. 


VIIL2.1 Canonical Poisson Structure 


... quelques remarques Sur la plus profonde decouverte de M. Poisson， 
mais qui, je crois, na pas ktE bien comprise ni par Lagrange, ni par les 
nombreux gometres quil ont citke, ni par Son auteur lui-meme. 

(C.G.J. Jacobi 1840, p.350) 
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The derivative of a function 玉 (p, q) along the fow of a Hamiltonian system 


is given by (Lie derivative, see (II.5.3)) 
dd d 
Q OF OF OF Oo 0P0 万 


2 上 





This remarkably symmetric structure motivates the following deftinition. 


Definition 2.1. The (canonical) Poissom pracket of two Smooth functions 严 (P,9) 
and G(D,dq) is the function 


QI 


OFOG 0F0G 
0 沪 人 0pi 9m 现 3 





or in vector notation { 忆 GO = YE) 7IVG(, wherey = (pq) and is 
the matrix of (VI.2.3). 


This Poisson bracket is bilinear, skew-Ssymmetric ({, G} = 一 {G, 忆 ,itsatis- 
fies the Jacopi iaem1ity (Jacobli 1862, Werke 3, p.40) 


{{ 甩 G 万 +{T(G, 本 +{T 五 有,G}=0 (2.4) 
(notice the cyclic permutations among 天, C,), and Zeipnzz rule 
{ 下 .G 万 } = 天.{G 万 十 G [五 万 上 (2.5) 


These formulas are obtained in a straighttforward manner from standard rules of 
calculus (See also Exercise 1). 
With thlis notation, the Lie derivative (2.2) becomes 


Q 


天 忆 G(G) = { 如 J(y 人 的 ) C.0) 


It follows that a function 7(p, 9) is a first integral of (2.1) 让 and only 让 
{7 万 } = 0. 


If we take 严 (V) = Wi, the mapping that selects the ;th component of y，we see that 
the Hamiltonian system (2.1) or (VI.2.59),7 = .JIV 万 (yy), can be written as 
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了 Poisson's Discovery。 At the beginning of 
the 19th century, the hope of being able to 
integrate a given System of differential equa- 
tions by analytic formulas faded more and 
more, and the energy of researchers went to 
the construction of, at least, first integrals. In 
thlis enthusiasm，Jacobi declared the Subse- 
quent result to be “Poisson s deepest discov- 
ery”(See citation) and his own identity, de- 
veloped for its proof, a “gravissimum Theo- 
rema . 


Theorem 2.2 (Poisson 1809). 矿 站 and 7 
G1e jjrst Iteg1aly, 1 太 e1l 加 eir Poissom DP1acket 
{ 辣 ,Zagaz astintegral. 





Proof This follows at once from the Jacopi 
identity with 五 王石 andC = 三 . 


Sim&on Denis Poissonl 














VIIL.2.2 General Poisson Structures 


... the general concept of a Poisson manifold should be credited to So- 
phus Lie in his treatise on transformation groups . . . 
(J.E. Marsden 广 工 S. Ratiu 1999) 


We now come to the announced generalization of Definition 2.1 of the canoni- 
cal Poisson bracket，invented by Lie (1888). Indeed, many proofs of properties 
of Hamiltonian Systems rely uniquely on the bilinearity, the SKkew-Symmetry and 
the Jacobi identity of the Poisson bracket，but not on the Special Structure of 
(2.3). So the idea ls, more generally, to start with a Smooth matrix-valued function 
B(y) = (bi( 作 ) and to set 


(Relg= 六 二 风 oug 


0CG(y) 
OV5 





(2.8) 
5 一 1 
(or more compactly { 已 GO 三 VE BODYG()， 


Lemma 2.3. 77Pe Drackel dejizzed 0 (2.8) 1 Diiea1 SKew-sy11z111etric Gd110 Satisjiey 
ZeID1iZ 11e (2.5) 08 Well as 1pe Jacopi ide1tity (2. 和 矿 a1ad oly 矿 








05 三 一 oa Joralli (2.9) 
G11G Jor a1L 2 7 (aotice 加 e cyclic Permattatio118 G11O118 77 天) 
wb 人 OK(y) Obki(y) 
b | bi | 司 2.1 
2 二 0 10 


1 Simkon Denis Poisson, born: 21 June 1781 in Pithiviers (France), died: 25 April 1840 in 
Sceaux (near Paris). 
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Prooj The main observation is that condition (2.10) is the Jacobi identity for the 
Special choice of functions 已 三 Vi G = 三 2 瓦 三 Wr because of 








{0 9 = 7(). (2.11) 


H 开 equation (2.4) ls developed for the bracket (2.8),，one obtains terms containing 
Second order partial derivatives - these cancel due to the Symmetry of the Jacobi 
identity - and terms containing first order partial derivatives; for the latter we may 
assume PC, 囊 to be linear combinations of Vi，Vj， VSO we are back to (2.10). 
The details of this proof are left as an exercise (See Exercise ]1). 














Definition 2.4. If the matrix 妃 (y) satisfies the properties of Lemma 2.3, formula 
(2.8) is Said to represent a (general) Poisso7 pjacket. The corresponding differential 
System 

二 了 OO)Y 五 (y)， (2.12) 


is a _Poissom syste11. We continue to call 互 a Hamiltonian. 


The System (2.12) can again be written in the bracket formulation (2.7). The 
formula (2.0) for the Lie derivative remains also valid, as is seen immediately from 
the chain rule and the definition of the Poisson bracket. Choosing 下 = 也 ,this 
Shows in particular that the Hamiltonian 囊 ls a first integral for general Poisson 
Systems. 


Definition 2.$S.A function C(V) is called a Caszzzzz7 .jurmction of the Poisson System 
(2.12), 让 
VC()7B() =0 for all y/， 


A Casimir function is a first integral of every Poisson System with Structure 
matrix 妃 (y), whatever the Hamiltonian 互 (y) is. 


了 xample 2.6， The Zooa-Wjierra equations of Sect.I.1.1 can be written as 


() 一 企 9 立 末 (ww 由， (2.13) 


Where 克 (v) 三 怀 一 In 二 vv 一 2Inoisthe invariant ([.1.4). This is of the form 
(2.12) with a matrix that is Skew-Symmetric and Satisfies the identity (2.10). 

Higher dimensional Lotka-Volterra Systems can also have a Poisson Structure 
(See, e.g., Perelomov (1995) and Suris (1999)). For example, the System 


丰 三 Jp 十 9)， 帮 = 一 十)， 镀 三 四 ( 匀 十 巡 十 了 
can be wiitten as 


多 0 V1y2 193 
加 |=|-n 0 -加 | YVJ) (2.14) 
73 一 V173 。 V273 0 
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with 互 (W) = -人 册 十 加 十 办 十 mo 一 mys. Again one can check by direct com- 
putation that (2.10) is Satisfied. 

In contrast to the structure matrix .j-! of Hamiltonian systems in canonical 
form, the matrix 妃 (y) of (2.12) need not be invertible. All odd-dimensional skew- 
Symmetric matrices are singular, and so is the matrix 妃 (y) of (2.14). In this case， 
the vector v(V) = (一 1/0, 一 Lo 1/ya)7 satisfies v( 力 7 B(V) = 0. Since v(y) = 
VC with CU) = 一 mw-imny 二 mya,the function CUy) is aCasimir function. 


VIL2.3 Hamiltonian Systems on Symplectic Submanifoldqs 


An important motivation for Studying Poisson Systems is given by Hamiltonian 
problems expressed in non-canonlical coordinates. 


了 上 xample 2.7 (Constrained Mechanical Systems). Consider the System (1.9) 
written as the differential equation 


忆 一 OO +》 Xiz] VD) (2.15) 


?一 工 


on the manifold AM = {zi;c(z) = 0} with c(z) = (9(0)G(O) 瓦 (p， g 办 ) and 
Z 一 (D,d)7 (see (1.14)). As in the proof of Theorem 1.2, Xi(zZ) and gi(z) are the 
components of A(Z) and g(z), and 和 A(Z) is the function obtained from (1.12). We 
use WE 了 月 244-mm) as local coordinates of the manifold A4 via the transformation 


Z 三 X(y). 


In these coordinates, the differential equation (2.15) becomes, with X(V) = X(y)， 


XO)8 = (YE(CXO)+y AN(xO)vo(x())) 


并 


We multiply this equation from the left with X(z)L .7 and note that the columns of 
X( 力 ，which are tangent vectors, are orthogonal to the gradients V9i of the con- 
Straints. This yields 


X()7 ITX(O)D = X(O)7Y 厅 (X(y)， 


By assumption (1.13) the matrix X(y)7 IJX( is invertible. This is seen as follows: 
X(7 JTJXOu = 0implies JX()u = c(z)7owo for some w (2 = X( 力 ). By 
c(X( 人 ) = 0andc(z)X(O) = 0 we getc(z)J Ice(z)7uw = 0.Itthen follows 
from the structure of c (z) and from (1.13) that w = 0 and hence alsov = 0. 

With B(V) = 三 (KX()TITX(J)) and 开 (y) = 五 (X(W)) , the above equation 
fory thus becomes the Poisson System = 已 (V)V 天 (四 .The matrix 妃 (V) is skew- 
Symmetric and satisfies (2.10), see Theorem 2.8 below Or Exercise 11. 
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More generally, consider a symzpleciic spaa1aif1d .NM of 陈 2d, thatis,amanifold 
for which the symplectic two-form” 


wz(6lc) = 一 (JS) for 和 ETA (2.160) 


(with (.,,) denoting the Euclidean inner product on 月 29) is mom-degemerate for every 
ZEAf:forsciinthetangent space 7Tz 人 1， 


wz 上 cz)=0 foral es7A implies 6 和 =0. 


In local coordinates Z = X(y), this condition is equivalent to the invertibility of 
the matrix XU ITX() with X(V) = X( 力 ,since every tangent vector at Z 一 
X(V) is of the form e = X()7 and X(Vy) has linearly independent columns. A 
manifold defined by constraints, AM = {z E 耻 24|c(z) = 0}, is symplectic ifthe 
matrix c (ZJ 一 lc(z)7 is invertible for every ZE 人 (see the argument at the end 
of the previous example). This condition can be restated as Saying that the matrix 
({ci cj}(z)) of canonical Poisson brackets of the constraint functions is invertible. 

We consider the reduction of the Hamiltonian System to the Symplectic Subman- 
ifold 人 4, which determines solution curvest mm zt E 人 by the equations 


(之 -VE(z)6=0 foral ee 人 人 4. (2.17) 


With the interpretation (V 万 (z),6) = 五"((Z)6 = 芭 |，0 五 07 的) as a directional 
derivative along a path 7 人 E At with yY(0) = Z and (0) = 和 ,itis sufficient 
that the Hamiltonian 豆 is deftined and differentiable on the manifold 人 4. Equation 
(2.17) can also be expressed as 


ua(2 刁 = 再 (z)E forall Eee 到 AM (2.18) 


a formulation that is Susceptible to further generalization; cf. Marsden 贸 Ratiu 
(1999)，Chap.S.4, and Exercise 2. Choosing 上 = 之 we obtain 0 = 万 '(z)z = 
芋 刀 (z( 鸭 ), and hence the Hamiltonian is conserved along solutions. 

Note that for A4 of Example 2.7, the formulation (2.17) is equivalent to the 
equations of motion (2.15) of the constrained mechanical System. It corresponds to 
d 4iemperts Principie of virtual variatio1sy in constrained mechanics; see Arnold 
(1989), p. 92. In quantum mechanics the 了 amiltonian reduction (2.17) to a mani- 
fold (in that case, a submanifold of the Hilbert space 7 ( 尺 ,及 2?) instead of 了 及 29d) 
is known as the Dirac-Pekel Le-depe1de1t vI1iatio11a1 Prizcipie and ls the ba- 
Sic tool for deriving reduced models of the many-body Schrodinger equation; See 
Sect.VIIL6 for an example. From a numerical analysis viewpoint, (2.17) can also be 
viewed as a Galerkin method on the solution-dependent tangent Space 了 Tz 人 4. 

In terms of the symzplectic projecfion P(z) : 及 24 一 人 AM for z e 人 defined 
by determining P(z)v e 人 AM forv e 及 24 from the condition 


(JP(z)u6) = (Jo 6) foral 和 Se 六 人 A， (2.19) 


2 Notice that this two-form is the negative of that introduced in Sect. VIL2. This slight in- 
consistency makes the Subsequent formulas nicer. 
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formula (2.17) can be reformulated as the differential equation on 人 4， 
= P(z)JIV 厅 (z). (2.20) 


In coordinates z = X(y), and again with X(V) = X(y), formula (2.17) becomes 


X(OO7(IX(O08- YECX(GD)) = 


and with 


一 工 


BO) = (XIXO)) and 天 (9) = 万 (X(y)， (2.21) 


we obtain the differential equation 
汪 有)Y 玫 (， (2.22) 


Theorem 2.8. For a Pamailiomia1z 8yste1a (2.17) o11 G Sy1Dlectic SUD1IG11Ld 人 1 
1je egUiVale1t dereatial eqguation 胡 Vocal coo1di1zateys， (2.22) mi (2.21) 1 4 
Poisso7m 8yS1e111. 


Proof Im coordinates, the Symplectic projection ls given by 
P(z)=XU)BO)XGO)7 IT for z=XO) < 人， 
Since for every tangent vector E = X(y)7 we have by (2.21)， 
(JXBXT.Juo,Xn) = (XIT.JXBX7Jo7)=(X7TJom) = (Jo X7). 


From the decomposition 阴 24 = P(z) 了 246(7T 一 P(z)) 阴 24 we obtain, by theimplicit 
function theorem, a _ corresponding splitting in a neighbourhood of the manifold A4 
in 及 24， 

u 一 Z 十 WU ， with ZEA 人 APz)w=0. 
This permits us to extend smooth functions 严 (y) to a neighbourhood of A4 by 
Setting 


天 (w) 一 下 forv=2Z 二 ww with zz 三 X(，P(z)w = 0. 


We then have for the derivative 到 (oz 三 义 (Z)P(z) for z E At and hence for its 
transpose, the gradient VE(z) = P(zZ)7VF(z). Moreoven by the chain rule we 
have VON) = XU) VF(z) for z = X(y). For the canonical bracket this gives， 
at2Z 一 X(y)， 


{ 开 ,Gana(z) = VYF(zT7P(z)J-LP(z)TVGC(z) 
= VEFU)7BUVYGUW) = { 玉 GO)， 


and hence the required properties of the bracket defined by 肆 (y) follow from the 
corresponding properties of the canonical bracket. 
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VIL3 The Darboux-Lie Theorem 


Theorem 2.8 also Shows that a Hamiltonian System without constraints becomes a 
Poisson System in non-canonical coordinates. Interestingly, a converse also holds: 
every Poisson System can locally be written in canonical Hamiltonian form after 
a Suitable change of coordinates. This result is a Special case of the Darpoxx 一 Lie 
7jpeore11. Its proof was the result of Several important papers: Jacobi's theory of 
Simultaneous linear partial differential equations (Jacobi 1862), the works by Cleb- 
Sch (1866) and Darboux (1882) on Pfaffian Systems, and, finally, the paper of Lie 
(1888). We shall now retrace this development. Our first tool is a result on the com- 
mnutativity of Poisson flows. 


VJI.3.1 Commnutativity of Poisson Flows and Lie Brackets 


The elegant formula (2.60) for the Lie derivative ls valid for general Polisson Systems 
with the vector field /yy) = 已 (y)V 瓦 (V) of (2.12). Acting on afunction 已 : 月 ”一 
及 , the Lie operator (II.3.2) becomes 


站 巴 =VFIA=VFIPBOJVY 末 ={ 忆 如 (3.1) 


and is again the Poisson bracket. This observation ls the key for the following 
lemma，wphich Shows an interesting connection between the Lie bracket and the 
Poisson bracket. 


Lemma 3.1. ZLet fo sootP 有 azmailtonians 瓦 由 (wy) and 下 (wy) pe givenm. 
不 忆 1 1 1e Lie operator of (J)VHul 
CI11G D2> 1Pe Lie operalor oF 巨 (J)VP， (3.2) 
ipen [Di, Do] ithe Lie operator of B( 人 V{ 五 站 厅 D} 
(11otice, O11Ce da8a11 1at tje idicey 1 a1d 2 jayve pee1 7eyveryedh. 
Prooj After some clever permutations, the Jacobi identity (2.4) can be written as 
{ 人 忆 万 站 一 人 玉石 0, 瑟 人 = 有 1ED ED， 6G3.3) 
By (3.1) this is nothing other than Di1Do 忆 一 DoDIP = [Di Do] 卫 . 














Lemma 3.2. Comsider no smooth Familonians 五 H(y) and 下 (yy) om an open 
CO11lected Sef WII 太 Di da1d Da 1je correspo1ndi118 ZLie ope1ators G11d 2 驯 (2) and 


2 加 (V) 1pe correspo1dijzg jiows. 7Tjpe1 trtpe atrrx 已 Tinvertiple tpejoiowing 


G1e equivalent 1 7 
(人 { 瑟 可 ,万 四) = Const; 
( 荔 [Di Da] = 0; 
人 


7T7je comclzsio1g“( 人 从 僵 ( 刘 兮 ( 间 ”also pold Jora 1on-iavertipie By). 
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Prooj This is obtained by combining Lemma IIU.5.4 and Lemma 3.1. We need 
the invertibility of 妃 (y) to conclude that { 万 中, 瑟 P} = Const follows from 
BO)V{ED, 厂 B} = 0 














VIL.3.2 Simultaneous Linear Partial Differential Equations 


Itwo functions 严 (y) and G(V) are given, formula (2.8) determines a function 
1(V) = {EG by differentiation. We now ask the inverse question: Given func- 
tions G(V) and AP(V)，can we find a function 己 (y) such that { 忆 GT 三 PCO) 2 
This problem represents a first order linear partial differential equation for 天 .So We 
are led to the following problem, which we first discuss in two dimenslions. 


One Equation. Given functions a(y,y2)，p( yo), 几 (Vi yo)， fnd all solutions 
三 (Viy2) satisfying 


OF 已 OP 
Qa(V1 ya) 5 十 by2) 二 及 (1 V2). (3.4) 


This equation is, for any point (yi, yz), a linear relation between the partial deriv- 
atives of 严 ，but does not determine them individually. There ls ozme direction， 
however， where the derivative ls uniquely determined, namely that of the vector 
风 一 (a(y pp) (72)), since the leftrhand side of equation (3.4) is the direc- 
tional derivative 多 ， The lines, which everywhere respect this direction, are called 
CHpa1acte1istic Lijies (See left picture of Fig.3.1). 开 we parametrize them with a para- 
meter we can compute Wi(b,ya(b as wellas 开 ( = 下 (人 , 鸭 ) as solutions 


of the following ordinary differential equations 


册 三 Qi)， 妇 三 以)， 玉 = 六 02)， (3.5) 


The initial values (1(0),y2(0)) can be chosen on an arbitrary curve yY (which must 
be transversal to the characteristic lines) and the values 天 |> can be arbitrarily pre- 
Scribed. The solution 严 (,y2) of (3.4) is then created by the curves (3.3) wherever 
the characteristic lines go (Tight picture of Fig. 3.1). 





Fig. 3.1. Characteristic lines and solution of a first order linear partial differential equation 
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Fig. 3.2. Characteristic Surfaces of two first order linear partial differential equations 


For one equation in m dimensions, the initial values (1(0),. ,gr(0)) can be 
freely chosen on amanifold of dimension 即 一 1 (e.g.,the Subspace orthogonal to the 
characteristic line passing through a given point), and E can be arbitrarily prescribed 
on this manifold. This guarantees the existence of 刀 一 1 independent Solutions in 
the nelighbourhood of a given point. Here, independent means that the gradients of 
these functions are linearly independent. 


Two Simultaneous Equations. Two Simultaneous equations of dimension two are 
trivial. We therefore suppose y = (Wi,y2,ya) and two equations of the form 


人 











乓 贡 十 吃 ( 的 有 十 败 ( 乓 有 一 各 ( 
(3.0) 
OP 书 OP 书 OP 书 
ao 人) 5 十) 5 二 = 知人 


Ovs 


for an unknown function 已 (yy2,y3). This system can also be written as 1 玉 = 


有 1 ， Do = 1 where Di denotes the Lie operator corresponding to the vector field 
a 园 (四 . Here, we have 1hwo directional derivatives prescribed, namely and 
where mi 一 a 四 (y) (see Fig.3.2). Therefore, we will have to follow both directions 


and, instead of (3.3), we will have 1po sets of ordinary differential equations 





太一 ao 人， 交 = 呈 W， 为 = 人 户 = 广 人 ( 急 


(3.7) 
六 =op(W)， 加 =oW)， 六 =o()， 忆 = 和 (人 急 ， 


开 we prescribe 忆 on a curve that ls orthogonal to ml and m2, and 过 we follow the 
solutions of (3.7), we obtain the function 已 on two 2-dimensional surfaces 91 and 
2 containing the prescribed curve. Continuing from .91 along the second fow and 
from 92 along the first fow, we may be led to the Same point, but nothing guarantees 
that the obtained values for 玉 are identical. To get a well-defined 天 additional 
asSumptions on the differential operators and on the inhomogeneities have to be 
made. 

The following theorem，which is due to Jacobi (1862)，has been extended 
by Clebsch (1866)，who created the theory of comazplete syste118 (vollstandige 
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Systeme ). These papers contained long analytic calculations with myriades of for- 
mulas. The wonderful geometric insight is mainly due to Sophus Lie. 


Theorem 3.3. Zef Di .Damr pe7n (mm <7) near dzzerential operatory 0 琢 " 
comrespondinzg lo Vector jields aH( 四 ,aml(y) amd suppose pat 1pese veclors 
Ga1e iea1ly Iadependet Jory 三 2V0. 不 


[Di Dil=0 Jor al (3.8) 
1je1 1je jpo111oge11eoUS SySte111 
Ji =0 Jor 三 |.. .7 


Possessey (10 0 11eig1pox1jpood of yo) 7 一 70 SO1U1io11s Jor whicpn 1pe gradie111s 
VP(Vo) ae inear1y idepenade1tk 
EUFtperao1e, 1pe io11o8ge11eousy SySte1a of DaiaL di1jerential eqguatio118 


Di 王 放 Jor 三 1 . .710 


Dossessey Q Da11iCULG1 SOLULiOP z0 G 11e18HDoU1pood of yo 矿 a1zad only 矿 癌 additiom 
1o (3.8) 1pe jzactionsy Pi( 力 ;Rnm(y) Satisjy te integrapilipy coaditio1zs 


Di 一 Dj1ji Jor GL1 2; 7. (3.9) 


Proof (a) Let V denote the space of vectors in 月” that are orthogonal to aD] (yo)， 
.alm(yo), and consider the (7 -mm)-dimensional manifold AM = yo 十 了 .We 
then extend an arbitrary Smooth function 天 : AMM 一 玉 to aneighbourhood of yo by 


Fl(e 风 ooppGo+) =F(o+ 吕 ， G.10) 


Notice that (三 tu) mV 三 pp 四 全 .opbi(yo 十 W defines a local diffeomor- 


phism between neighbourhoods of 0 and yo. Since the application of the operator 
Dam to (3.10) corresponds to a differentiation with respect to tm and the expression 
(e 风 oooh(y 十 U) isindependent oftm by (3.10), we get DmF(y) = 0 
To prove DiP(y) = 0fori < 7 we first have to change the order of the fows 人 ] 
in (3.10), which is permitted by Lemma II.5.4 and assumption (3.8), So that 有 18 
in the left-most position. 

(b) The necessity of (3.9) follows immediately from Dijpjy = DiDiE = 
DjDi 下 = 也) 六 .For given Pi satisfying (3.9) we define 严 (y) in a neighbourhood 
of yo (il.e., for small 六 .tn and small v) by 


tl 
PF(eg oogm(%+ 相 ) 一 人 和 (9 (yo 十 吕 ) 坟 
0 


tm 
+ i( 四 op 榴 oog 有 (+ 加) 邮 
0 
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and we prove that it is a solution of the System 乙己 三 ji for; 三 1 ...,70. Since 
only the last integral depends on tr，we immediately get by differentiation with 
TesSpect to tm that Dm 下 = 几 m. Eor the computation of Pi we differentiate with 
Tespect to 如. The first 7 一 1 integrals are independent of 误 . The derivative of the 
?ith integral gives Pi; (oo . 9 Oo (yo 十 2))， and the derivative of the remaining 
integrals gives 


刀 7 
小 ;J ( 虽 Ga 20(m 十 中 】 dl 三 1 万 7 ,9 20(m 十 吕 ) 直 
0 0 


= ji 人 .oo (yo 二 ) 一 (e 扩 1 本 十 吕 ) 














for7 = 十 1 ...,70.Summing up, this proves 万 下 三 用. 


VJII.3.3 Coordinate Changes and the Darboux-Lie Theorem 


The emphasis here is to Simplify a given 
Poisson Structure as much as possible by a 
coordinate transformation. We change from 
coordinates 2 ,or to 页 ( 作 ) 入 ( 切 
with continuously differentiable functions 
and an invertible Jacobian 4(y) = 9V/6y， 





Fig. 3.3. New coordinates in a Poisson System 


Jean Gaston Darboux3 


and we denote 下 ( 念 := 天 (and G( 念 := 

CG(V) (see Fig.3.3). The Poisson structure as well as the Poisson fow on one Space 
will become another Poisson Structure and fow on the other Space by Simply apply- 
ing the chain rule: 











oOF 念 9 次 ， 07 60G( 切 
0 让 一 一 (3.11) 
呈 人 1 DOW o 人 人) 0O1 0 


This ls another Polisson Structure with 


一 { 芒 ) or 了 人 力 =4(JB( 人 4(0)7 G.12) 


3 Jean Gaston Darboux, born: 14 August 1842 in Nimes (Francej, died: 23 February 1917 
in Paris. 
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The same structure matrix ls obtained ifthe Poisson System (2.12) 1s written in these 
new coordinates (Exercise 9). 二 

Since 4 is invertible, the structure matrices 已 and 已 have the Same rank. We 
now want to obtain the Simplest possible form for 万 . 


Theorem 3.4 (Darboux 1882, Lie 1888). Suppose 1Pat ipe 1aatrix 已 () dejizres 
QG Poisso1m Piacket a11d 18 or co1sta11t 101K 风 一 gg 三 270 1 Q0 71ei8ghpoxrpood of 
V0 E 下 ”7Tje1， tere exist jactio1s ( 四 (人 Qi,Qm(y)， and 
Ci Cr(y) satisPing 


{P 成 }=0 { 忆 ,Qi = 一 055 {P,C =0 
{@i = 65 {@iQi =0 {G@iC =0 (3.13) 


O11 0 11eig1hpotrpood or yo. 7T1e gfadie1ni of 万,Qi CE da1e 1inea11y indepe1de11b 
So 1jatg (已 (四 ,Qi(0),Ck()) comstitutes a Vocal change or coorelinates 加 
CQ11O11ICQL Jo7711. 


The functions C1(y),. ,Co(V) are called distizzgxisped .jznctioms (ausgezeich- 
nete Funktionen) by Lie. 


Proojf We follow Lie's original proof. Similar ideas, and the Same notation，are 
also present in Darboux'”s paper. The proof proceeds in Several Steps, Satisfying the 
conditions of (3.13), from one line to the next, by solving Systems of linear partial 
differential edquations. 

(a) 开 all bij(yo) = 0, the constant rank assumption implies bij(y) = 0ina 
neighbourhood of yo. We thus have mm = 0 and all coordinates Ci(Vy) = WwW are 
Casimirs. 

(b) 开 there exist 77 with bij(y0) 夭 0, we set Q1(0V) = Vi and we determine 
书 (y) as the solution of the linear partial differential equation 


{Q@- 玉 = 1 (3.14) 


Because of bij(yo) 天 0 the assumption of Theorem 3.3 is satisfied and this yields 
the existence of 亡 . We next conslider the homogeneous System 


{Q1P =0 and { 万 ,有 =0 (3.15) 


of partial differential equations. By Lemma 3.2 and (3.14) the Lie operators cor- 
responding to Cl and 疡 commnute, so that by Theorem 3.3 the System (3.13) has 
有 一 2independent solutions F3,...,A.Their gradients together with those of Cl 
and 疡 form abasis of 极 "”. We therefore can change coordinates from 71,. ,yn to 
QTL 万 , ,Er (mapping yo to yo). these coordinates the first two rows and 
the first two columns of the structure matrix 妃 (y) have the required form. 

(c) 开 1 (yo) =0foral27 >3,wehave7mm =1(Similartostep (a)) and the 
coordinates F3,...,En are Casimirs. 
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(d) If there exist1 > 3and7y > 3 with ( (加 ) 入 0, we set Q@a = 克 and we 
determine 户 from the inhomogeneous System 


{Q1 已 }=0， { 己 , 忆 } = 0， {@2, } = 1 


The inhomogeneities satisfy (3.9), and the Lie operators corresponding to QC1, 已 ， 
Q2 commnute (by Lemma 3.2). Theorem 3.3 proves the existence of such a 忆 . We 
then consider the homogeneous System 


{@u =0，{ 人 了 =0，{e@ 了 =0，{ 人 PE=0 


and apply once more Theorem 3.3. We get 见 一 4independent solutions，which 
we denote again 55,...,zn. Asin part (b) of the proof we get new coordinates 
QI 万 ,Ga 忆 , 65 ,Enfor which the first foxrrows and columns of the struc- 
ture matrix are canonical. 

(e) The proof now continues by repeating Steps (c) and (d) until the structure 
matrix has the desired form. 














Corollary 3.S (Casimir Functions). 710 11e sitzatiozz of 7TPpeore1a 3.4 加 e jzactio11y8 
Ci Cr) sa 及 


{Ci 万 }==0 Jorallsmootp 万 . (3.10) 


Pioof Theorem 3.4 states that VCi( 人 7B(N)V 瓦 (V) = 0, when 刀 (y) is one ofthe 
functions 已 ( 四 Qi or Cj( 四 However the gradients of these functions form a 
basis of 及 "”. Consequently VCi( 人 TB(V) = 0 and (3.16) is satisfied for all differ- 
entiable functions 达 (z)， 














This property implies that all Casimir functions are first integrals of (2.12) what- 
ever 万 (y) is. Consequently, (2.12) is (close to yo) a differential equation on the 
manifold 

A 人 ={yEUICiW = Constii1 三 |,70 (3.17) 


Corollary 3.6 (Transformation to Canonical Form). Demzote 如 e trams1jormaatiom 
ofTjeorem 34 思 2 一 8) = (已 (人 Qi,Ck(J)) WP is change ofcoordi- 
11atey, 1jpe Poissom syste1 0 三 已 (V)V 万 (人 pecomres 


J-1L 0 
必 = BoV 开 (2 Wit 0 一 0 0 (3.18) 


Wjpee 开 (2) 三 古 (). Writingz = (Pdqc), ti syste1a pecom1ey 
六 = 一代 v(D, 0,c)， d= 开 pn(D,9,c)， c=0. 
Prooj The transformed differential equation is 


之 一 水 (BY VE) with yy = 人 (2 














and Theorem 3.4 states that y%(y)B( 人 JU) = 0. 
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VIIL.4 Poisson Integrators 


Before discussing geometric numerical integrators，we Show that many important 
properties of Hamiltonian Systems in canonical form remain valid for Systemas 


= BO)V(y)， (4.1) 


Where 妃 (V) represents a Poisson bracket. 


VJII.4.1 Poisson Maps and Symplectic Maps 


We have already seen that the Hamiltonian 妃 (y) is a first integral of (4.1). We shall 
Show here that the fow of (4.1) Satisfies a property closely related to Symplecticity. 


Definition 4.1. A transformation  : ZL 一 民 "” (where V is an open set in 慌 "”) is 
called a Poissom 1aP with respect to the bracket (2.8), if its Jacobian matrix Satisfies 


2(JB()w(O7 = B(e(). (4.2) 


Anequivalent condition is that for all smooth real-valued functions 忆 , G defined 
on 2(D)， 
to2GopO={GJe(O))， (4.3) 


as is Seen by the chain rule and choosing 忆 ,G as the coordinate functions. It is 
clear from this condition that the composition of Poisson maps is again a Poisson 
map. A comparison with (3.12) Shows that Poisson maps leave the Structure matriX 
invariant. 

For the canonical symplectic structure, where 妃 (y) = .1 condition (4.2) is 
equivalent to the symplecticity of the transformation (V). This can be seen by takc- 
ing the inverse of both Sides of (4.2), and by multiplying the resulting equation with 
o2'(y) from the right and with 2'( 四 7 from the left. Also in the situation of a Hamil- 
tonian System (2.17) on a symplectic submanifold A4, where 刀 (y) is the structure 
matrix of the differential equation in coordinates y as in Theorem 2.8，condition 
(4.2) ls equivalent to Symplecticity in the Sense of preserving the Symplectic two- 
form (2.16) on the tangent Space, as in (1.10): 


Definition 4.2. A map V : 人 4 一 人 M on asSymplectic manifold M is called symz- 
Plectic 过 for every Z E 人 41， 


ww(W (ze (ze) = wz(ceo) forall 上 ce 友人 4. (4.4) 


Anear-identity map 纱 : 人 4 一 人 fis symplectic iand only ifthe conjugate map 
o%inlocal coordinatesZ = X(y), with o(Vy) givenby VW(zZ) = X(o(W))forz = X(y)， 
is a Poisson map for the structure matrix of (2.21), B(y) = (X()7 ITX(O))-1L with 
X() = X( 轨 . This holds because W(z)E = X(oly))p7 forz = X(V) and 
E=X(V)7, and because (4.2) is equivalent to wo) X(o()E ITX(e(J)e2(y) = 
X(J)7 7TX(y) 
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Theorem 4.3. 矿 B(V) 5 力 e Structure iatrix ofa Poisson prackekh 妨 en tjpe Jom 
oz(o) oftpe djerential equation (4.1)Ma Poiyson 10aD. 


Poof (a) For B(y) = ./: this is exactly the statement of Theorem VI.2.4 on the 
Symplecticity of the fow of Hamiltonian Systems. This result can be extended in a 
straightforward way to the matrix Bo of (3.18). 

(b) For the general case consider the change of coordinates > = V(y) which 
transforms (4.1) to canonical form (Theorem 3.4), ie., 9 ( 力 B()2( = Bo and 
= BoVK(z) with 天 (2) = 五 (V) (Corollary 3.6). Denoting the flows of (4.1) and 
冯 = BoVK(z) by pi(y) and Wi(2)j, respectively we have Wi(2(0)) = 2(er(y) 
and by the chain rule Wi (oO( 人 )W (0 = 巡 (P(y))2i) .Inserting this relation into 
好 (z)BowWi(z)7 = Bo, which follows from (a), proves the statement. 

Adirect proof, avoiding the use of Theorem 3.4, is indicated in Exercise 0. 














From Theorems 2.8 and 4.3 and the remark after Definition 4.2 we note the 
following. 


Corollary 4.4. 771e Jow ora Parailiomia1z syste1a (2.77) o11 Ga Sy1ID1ectic SUD1IG1- 
Jo1d is Sy11Plectrc. 


The inverse of Theorem 4.3 is also true. It extends Theorem VI.2.0 from canon- 
ically Symplectic transformations to Poisson maps. 


Theorem 4.S. Ze FIV) and 已 () pe comtizauotsiy Qierentiaple om da1z opem set 
DC 下 由，C1d aSsU11e 1Pat 已 () Jeprese1ts Q Poisso1z DJracket (Dejizaifiom 2. 逢 . 
7Tjez 0 = oocaly oftpejora (4 由 太 anad oniy 太 


e its Jiow pi(] respects pe Casinairs of (四 ELe Ci(pi( 人 ) = Consb amd 
e@ ilsjiomwiaPoisson 11ap Jorally EL andjorall suficiemnty s12011 


Prooj The necessity follows from Corollary 3.3 and from Theorem 4.3. For the 
proof of sufficiency we apply the change of coordinates (uc) = 9(V) of Theo- 
rem 3.4, which transforms 巨 (y) into canonical form (3.18). We write the differential 
equation y = /yy) in the new variables as 


也 三 g( c)， C 王 (uc). (4.3) 


Our first assumption expresses the fact that the Casimirs, which are the components 
of c, are first integrals of this system. Consequently，we have A(u,c) 三 0. The 
Second assumption implies that the How of (4.5) is a Poisson map for Bo of (3.18). 
Writing down explicitly the blocks of condition (4.2), we see that this ls equivalent 
to the symplecticity of the mapping wo 上 人 坟 uo,co)，with co as a parameter. 
From Theorem VI.2.6 we thus obtain the existence of a function 天 (w,c) such that 
gc) = JIVwK(u ec).Noticethat forflows depending smoothly on aparameter， 
the Hamiltonian also depends smoothly on it. Consequently, the vector field (4.3) 
is of the form BoVK(u, c). Transforming back to the original variables we obtain 
jy) = B(J)VY(y) with 五 (y) = 开 (9(y)) (see Corollary 3.6). 
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VJII.4.2 Poisson Integrators 


The preceding theorem Shows that “beingaPoisson map and respecting the Casimirs” 
is characteristic for the fow of a Poisson System. This motivates the following def- 
nition. 


Definition 4.6. A_ numerical method li = 21p(Vyo) is a Poissom integrator for the 
Structure matrix 妃 ( 轨 ,让 the transformation yo mm V1 Tespects the Casimirs and 让 it 
is a Poisson map whenever the method is applied to (4.1). 


Observe that for a Poisson integrator one has to Specify the class of Structure 
matrices 妃 ( 力 . A method will never be a Poisson integrator for all possible 妃 ( 攻 . 


了 xample 4.7. The symplectic Euler method reads 
&m 二 L 一 7 十 PanTIon 万 (nl Un )， Vmn+L 一 UVmn 一 jn 二 IUn 古 (nl um) 


for the Lotka-Volterra problem (2.13). It produces an excellent long-time behaviour 
(Fig.4.1,left picture). We Shall show that this is a Poisson integrator for all separable 
Hamiltonians 妃 (v) = 开 ( 轨 十 忆 (u). Forthis we compute the Jacobian ofthe map 
(nm， un) FE (n+l， Un+1)， 
1 一 AU 万， 1 人 (2 E | 
jUn (人事 ,十 Un ) 1 Dun un) NO 1 一 Pulpn+l 瑟 v 
(the argument of the partial derivatives of is (w+li,un) everywhere)，and we 
check in a straightforward fashion the vallidity of (4.2). A different proof，using 
differential forms, is given in Sanz-Serna (1994) for a Special choice of 克 (让 ). 
Similarly, the adjoint of the Symplectic Euler method is a Poisson integrator, and 
SO is their composition - the Stormer-Verlet Scheme. Composition methods based 
on this Scheme yield high order Poisson integrators，because the composition of 
Poisson maps is again a Poisson map. 
The implicit midpoint rule, though symplectic when applied to canonical Hamil- 
tonian Systems, turns out not to be a Poisson map for the structure matrix 肆 (uv) of 


(2.13). Figure 4.1 (right picture) shows that the numerical solution does not remain 
near a closed curve. 


Itis adiffhicult task to construct Poisson integrators for general Poisson Systems; 
cf the overview by Karasozen (2004). First of all, for non-constant 妃 (y) condi- 
tion (4.2) ls no longer a quadratic first integral of the problem augmented by its 
variational equation (See Sect. VI.4.1). Secondly, the Casimir functions can be ar- 
bitrary and we know that only linear and quadratic first integrals can be conserved 
automatically (Chap.IV). Therefore, Poisson integrators will have to exploit Special 
Structures of the particular problem. 


Splitting Methods. Consider a (general) Poisson system y = 已 (J)V 瓦 (V) and 
Suppose that the Hamiltonian permits a decomposition as 互 (y) = 瓦 H (V) 十 .十 
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本 


Sympl. Eule 


TARRRAS ROSS 
Verlet Jr impl. midpoint 














Fig. 4.1. Numerical Solutions of the Lotka-Volterra equations (2.13) (Step Size 几 三 0.25， 
which is very large compared to the period of the solution; 1000 steps; initial values (4, 2) 
and (6,2) for all methods) 


万 [四 (四 ,such that the individual systems 六 = B(W)VEG() can be solved ex- 
actly. The fow of these Subsystems ls a Poisson map and automatically respects 
the Casimirs, and So does their composition. McLachlan (1993), Reich (1993), and 
McLachlan & Quispel (2002) present Several interesting examples. 


了 xample 4.8. Im the previous example of a Lotka-Volterra equation with separable 
Hamiltonian 达 (wv) = 开 (w) 十 二 (wj, the systems with Hamiltonian 天 (w) and 
了 (u) can be solved explicitly. Since the fow of each of the subsystems is a Poisson 
map,，so jls their composition. Combining a half-step with Za full step with 及， 
and again a half-step with 元 , we thus obtain the following Verlet-like second-order 
Poisson integrator: 





必 
n+1/2 一 exp(5 onVZ(on))un 
Vn+1l 二 exp( 一 pun+l2VK(uwn+i/2))on (4.0) 
必 
Un+l 一 exp(5 UnHIVZL(un+i))un+l/2. 


In the setting of Hamiltonian Systems on a manifold, the Splitting approach can 
be formulated in the following way. 


Variational Splitting. Consider a Hamiltonian system (2.17) on a symplectic man- 
ifold AM, and use a splitting 瑟 = 五 上 十 五 P] of the Hamiltonian in the following 
algorithm: 


1. Let zz 二 E 人 be the solution at time 岂 /2 of the equation for z， 
(Jj 之 -VDEDU(z)S=0 foral ce 人 (4.7) 


with initial value Z(0) 王 Zn. 
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2. Let Z 7 


milibethe solution at time 丸 of 


(JZz -VED(m,6=0 foral ee 人 (4.8) 


with initial value z(0) = z 才 . 
3. Take zn+li as the solution at time P/2 of (4.7) with initial value Z(0) 一 ZmH1. 


Splitting algorithms for Hamiltonian Systems on manifolds have been studied by 
Dullweber，Leimkuhler & McLachlan (1997) and Benettin，Cherubini 包 Fasso 
(2001) in the context of riglid body dynamics; see Sect. VI.5. Lubich (2004) and 
FEaou & Lubich (2004) have studied the above splitting method for applications in 
quantum molecular dynamics; See 9ect. VJII.6 for an example. 

By Theorem 2.8, the substeps 1.-3. written in coordinates 2 = X(V) are Poisson 
Systems 1 一 妃 ( 力 Y 天 国 (y) with 开国 (y) = 万 加 (x(), but the algorithm itself 
is Independent of the choice of coordinates. Since the Substeps are exact fows of 
Hamiltonian systems on the manifold A4, their composition yields a Symplectic 
map. In the coordinates V the Substeps are the exact Hows of Poisson Systems, and 
hence their composition yields a Polisson map. 


Poisson Integrators and Symplectic Integrators. Generally we note the follow- 
ing correspondence, which rephrases the remark on Symplectic maps and Poisson 
maps after Definition 4.2. It applies in particular to the Symplectic integrators for 
constrained mechanics of Sect. YII.1. 


Lemma 4.9. 47 jztegrator 2Z1 三 胱 (Z0) Jor a Damailtonian syste1a (2.77) om 0 
1IG1111o1d A4 1 Sy1iDplectic 矿 a1ad opy 矿 11e itegrator Written 0 1ocal coo1dijzatey， 
41 一 2Bp(yo) correspozadizzg 加 da coo1dizzate Jap Z 一 X( ia Poisson integ1ator 
Jr tpe Structure 1atrix 妃 (y) off2.27 


VJIL.4.3 Integrators Based on the Darboux-Lie Theorem 


If we explicitly know a transformation > = V(y) that brings the system 7 = 
妃 (J)V(y) to canonical form (as in Corollary 3.6)，we can proceed as fol- 
lows: compnute zu = (rn); apply a symplectic integrator to the transformed sys- 
tem 2 = BoVK(z) (Bo is the matrix (3.18) and 开 (z) = 瑟 ()) which yields 
2141 一 胸 (2);compute fnally yi from 2n+1 三 91i).This yields aPoisson 
integrator by the following lemma. 


Lemma 4.10. Zet z = (cc) = QUV) pe ie transjprmaation of 7Tjheore1a 了 4. SUD- 
Pose 1pat 加 e integ1ator (V) 1GKes 加 e Jo171 


玩 (z) = 人 】 


C 


访 1e Hew variaples z = (cj. 7T1e1 到 (V) 1G Poissom iteg1ator 太 a1zzd omly 太 
凡 呈 (ic) 入 GSy1plectic inategrator oreveryc. 
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Pioof The integrator Zn(V) is Poisson for the structure matrix 巨 (y) 让 and only 让 
琴 (z) i Poisson for the matrix Bo of (3.18); see Exercise 7. By assumption, 万 (2 
preserves the Casimirs of 刀 o. The identity 


4.J-147 0 
史 mwr=( 2404 9 





with 4 = 90 呈 /0u proves the statement. 











Notice that the transformation 如 has to be global in the sense that it has to be 
the Same for all integration steps. Otherwise a degradation in performance, Similar 
to that of the experiment in Example V.4.3, has to be expected. 


了 xample 4.11. As a first illustration consider the Lotka-Volterra System (2.13). 
Applying the transformation V(wv) = (ntu jnvo) = (Pdq), this System becomes 
canonically 了 amiltonian with 


天 (D,9) = 一 瑟 (uu) = 一 囊 (ez,e?). 


开 we apply the symplectic Euler method to this Hamiltonian System，and 过 we 
transform back to the original variables, we obtain the method 


mn 二 1 一 un exp(jon 本 (un+iyon))， 


(4.9) 
Un 一 Un exp( 一 pun+iEuw(un+l un)) 





In contrast to the method of Example 4.7, (4.9) is also a Poisson integrator for (2.13) 
这 万 (wu) is not separable. If we compose a step with step size 丈 /2 of the symplec- 
tic Euler method with its adjoint method, then we obtain again, in the case of a 
Separable Hamiltonian, the method (4.0). 


了 上 xample 4.12 (Ablowitz-Ladik Discrete Nonlinear Schrodinger 上 quation). 
An interesting Space discretization of the nonlinear Schrodinger equation ls the 
Ablowitz-Ladik model 


212 十 二 (ta 一 2 大 十 钛 -十 | 多 (二 和 -1 = 0， 


Which we consider under periodic boundary conditions wk+N = Wi (Az = 1/N). 
It is completely integrable (Ablowitz-Ladik 1976) and, as we Shall see below, it is a 
Poisson System with noncanonical Poisson bracket. Splitting the variables into real 
and imnaginary parts, 人 三 人 十 2UK，We obtain 





大 = 一 二 (opH 一 2ok 十 -一 ( 吕 十 品 )(ost 二 wk 
bi 一 二 (ut 一 2 驮 十 WU _1) 十 (如 十 吕 )(wk+i 十 WE 1). 


Withv = (UN), uv 一 (ul ,UN) this system can be wiitten as 
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RN 


where 万 = diag(dl,...,dNjis the diagonal matrix with entries 


du) 一 1TTAz2( 人 十 o)， 
and the Hamiltonlian is 


N 
工 
瑟 (u) 三 5 > (wu 十 uro-ij 一 艺 一 一 一 3 (L+Az2(0w 十 邮 )). 
1=1 


We thus get a Poisson System (the conditions of Lemma 2.3 are directly verified). 
There are many possibilities to transform this System to canonical form. Tang， 
Perez-Garcfa & V 和 izquez (1997) propose the transformation 


1 ( Az ) 

Mr 三 一 一 一 arctanl 一 一 0 )， dF 一 VR) 
AzVI1T 二 Az2v V1I 十 Az2vz 

for which the inverse can be computed in a Straightforward way. Here,， we Suggest 

the transformation 


了 =UpotAz (十 鹃 ) In(l 
人 和 0) with (lz) = ee (4.11) 
大 二 Wo(Az2( 人 十 W)) 和 
which treats the variables more Symmetrically. Its inverse ls 
UK 二 DAK T(Az2(p2 十 08)) expZ 一 工 
和 with T(Z) = 一 一. 
UK 二 4dK T(Az (ZK 十 92)) 风 
Both transformations take the System (4.10) to canonical form. For the transforma- 
tion (4.11) the Hamiltonian in the new varliables 1s 


PP 志 r0 (2 十 ))r(Az 2(p2 1 十 0 1)) (ppri 十 wo ) 
2 | 
一 2 人 7 十 多) 
Applying standard Symplectic schemes to this Hamiltonian yields Poisson integra- 
tors for (4.10). 


VIH.S Rigid Body Dynamics and Lie-Poisson Systems 


.… these topics, which, after all, have occupied workers in geometric me- 
chanics for many years. (R. McLachlan 2003) 


An Important Poisson System is given by Euler's famous equations for the mo- 
tion of a riglid body (see left picture of Fig. 3.1), for which we recall the history and 
derivation and present various Structure-preserving integrators. Euler s equations are 
aparticular case of Lie-Poisson Systems,， which result from a reduction process of 
Hamiltonian Systems on a Lie group. 
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VJII.S.1 History of the Euler Equations 


“TILe sujet que je me propose de traiter ici, est de la derniere importance 
dans la MEcanique ; 廊 j ai d6ja fait plusieurs efforts pour le mettre dans 
tout Son jour. Mais, quoique le calcul ait asses bien Teussj, 儿 que j ai 
de6couvert des formules analytiques .…, leur application ktoit pourtant as- 
sujettie a des difficultks qui mont paru presque tout a fait insurmontables. 
Or depuis que j ai dkvelopk les principes de la conoissance meEcanique 
des corps, la belle proprikte des trois axes principaux dont chaque corps 
est douE, ma enfin mis en ktat de vaincre toutes ces difficultEs，..2” 
(Euler 1758b, p. 154) 


A great challenge for Euler were his efforts to establish a mathematical analysis for 
the motion of a riglid body. Due to the fact that such a body can have an arbitrary 
shape and mass distribution (See left picture of Fig.3.2), and that the rotation axis 
can arbitrarily move with tme, the problem is dif8icult and Euler struggled for many 
years (all these articles are collected in Opeya O1az1lia, Ser. JI Vols.7 and 8). The 
breakthrough was enabled by the discovery that any body, as complicated as may be 
its configuration, reduces to an inertia ellijpsoid with three Principal axes and three 
numbers, the principal moments of inertia (Euler 1758a; see the middle picture of 


Fig.3.2 and the citation). 
cx1P 一 AD 一 9 一 (全 7 


,1a duatribme et la Si 





ec 一 2O 2 on aura pareillement 
0 


21' 一 BI' 一 下 六 一 下 六; 





ad 一 ce AKCZY 
0 0 xxdt 一 -MO me, la cinduibme et la 
“，0 0n aUTa 
2 一 44 了 25 及 zt 7 
dx 于 FS 的 0 Meccec x&1 一 C7 一 GO 一 FI5; 


Fig. $.1. Left picture: frst publication of the Euler equations in Euler (1758b). Right picture: 
Principal axes as elgenvectors in Lagrange (1788) 


The Inertia Ellipsoid. We choose a moving coordinate System connected to the 
body 2 and we consider motions of the body where the origin is fxed. By another 
of Euler's famous theorems, any Such motion ls infinitesimally a rotation around an 
axXlS. We represent the rotation axis of the body by the directioz of a vector w and 
the Speed of rotation by the /eg 态 of w. Then the velocity of a mass point Z of 好 is 
given by the exterior product 


Cj2Y3 一 3Y2 0 一 3 CO2 尼 1 
光一 wxXxZ=| waszl 一 wl7z3a | 王 3 0 ”一 w1 22 (3.1) 
CI1Y2 一 W21 一 2 CI1 0 3 


(orthogonal to w, orthogonal to z, and of lengh ||w|| :| zll :sin 7y; see the left picture 
of Fig.3.2). The inetic ee18y 1 obtained by integrating the energy of the mass 
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Fig. S.2. A rigid body rotating around a variable axis (lefb; the corresponding inertia ellijpsoid 
(middle); the corresponding angular momentum (rightb) 


points dm over the body 


罗 全 扩 可 xzPmm (5.2) 
好 


和 
2 
1 


二 / (Crs 一 wszo) + (wszl 一 wlz3) 十 (wlizs 一 wozi)?】 am . 
好 


Hthis is multiplied out, one obtains 


太一 5w7Gw， Where Oii 本 上 


(z2 十 Z2) dm2，Gi 一 -上 Zi215 0712，( 天 大人. 
好 好 


(5.3) 
Euler (1758a) Showed, by endless trigonometric transformations,，that there exist 
Principal axes of the body in which this expression takes the form 





太一 (五 F oo2 1 五 轨 ) (5.4) 


This was historically the first transformation of such a3x3 quadratic form to diago- 
nal form. Later Lagrange (1788) discovered that these axes were the eligenvectors of 
the matrix O and the moments of inertia 大 the corresponding eigenvalues (without 
calling them So, See the right picture of Fig.5.1). 


The Angular Momentum. The first ljaw of Newton's Primcipia states that the 11o- 
1Ie11U1 .70 of a mass point remains constant in the absence of exterior forces，. 
The corresponding quantity for 7otatiozal motion is the a128VLQ7 112O11e1212112，1.e.， 
the exterior product z x vtimes the masS. Integrating over the body we obtain, with 


(3.1)， 
= exam= /exexa)tm (3.5) 
好 好 


Ithis is multiplied out, the matrix O appears again and one obtains the Surprising 
result (due to Poinsot 1834) 
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4 三口 w， or in the Principal axes coordinates， 人 三 有 ck (3.0) 


Such arelation is familiar from the theory of conjugate diameters (Apollonius, Book 
JI Prop. VD: the angular momentum is a vector orthogonal to the plane of vectors 
conjugate to w (See the Tight picture of Fig. 3.2). 


The Euler Equations. Eulers paper (1758a), on his discovery of the Principal axes， 
is immediately followed by Euler (1758b)，where he derives his equations for the 
motion of a Tigld body by long, doubtful and often criticized calculations, repeated 
in alittle less doubtful manner in Eulers monumental treatise (1765). Beauty and 
elegance, not only of the result, but also of the proof, is due to Poinsot (1834) and 
Hayward (18306). It is masterly described by 上 Klein & Sommerfeld (1897), and in 
Chapter 6 of Arnold (1989). 
From now on we choose the coordinate System, moving with the body, Such that 
the inertia tensor remains diagonal. We also watch the motion of the body from a 
coordinate System stationary in the space. The transformation of a vector z E 琢 3 in 
the body frame 4, to the corresponding 世 E 月 3 in the stationary frame, is denoted 
by 
=QGOUb7， (3.7) 


The matrix @C(b is orthogonal and describes the motion of the body: for z = ei we 
See that the columns of C 人 (为 are the coordinates of the body's principal axes in the 
stationary frame. 

The analogous statement to Newton's first law for rotational motion 1S: 态 如 e 
GDSe1ice of exierior da18Uar Jorcey 1je da118La7 12O11e1a1a111 人 yee1l 访 oz tpe jixed 
coorelinrate syste12 18 ad co1ls1a1t veclor 5. This same vector W, seen from the movimg 
frame,， which at any instance rotates with the body around the vector w, rotates im 
the opposite direction. Therefore we have from (3.1), by changing the Signs of w， 
the derivatives 


21 0 Ww3 一 w2 41 
|=|1-wa 0 Cl 2 | . (5.8) 
13 w2 一 wl 0 23 


If we insert wx = /大 from (3$.6), we obtain 


纪 0 oj/R -oo/RPN 1 Us 一 五 )9 
蚁 | = | 一色 / 厂 0 几 / 厂 吧 | =| (0 一 万 Wo 
久 o/2 0 1 多 (2 一 五 )g2 纺 
(5.9) 
or by rearranging the products the other way round， 
信 0 -内 W1/ 厂 
加 | =| 0 一 纹 22/TP | ， (5.10) 
2a -2 吨 0 ya/73 


“4Long-standing tradition, from Klein to Arnold, uses capitals for denoting the coordinates 
in this moving frame; but this would lead to confusion with our subsequent matrix notation 
5 For a proof of this statement by d'Alembert's Principle, see Sommerfeld (1942), 8IIL.13. 
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WwWrIitten in two different ways as a Poisson System，whose right hand vectors are 
the gradients of C(y) 三 3 2 and 万 (y) 王 且 汉 2 及 192，respectively. 
These are the two quadratic invariants of Chap.IV. The first represents the length 
of the constant angular momentum V in the orthogonal body frame, and the second 


represents the energy (9.4). 


Computation of the Position Matrix Q (如 . Once we have solved the Euler equa- 
tions for y( 轨 ,we obtain the rotation vector w( 加 by (5.6). It remains to find the ma- 
trix @( 轴 which gives the position of our rotating body. We know that the columns 
of the matrix @, seen in the stationary frame, correspond to the unit vectors ei in the 
body frame. These rotate, by (3.1), with the velocity 


0 一 3 CO2 
(@xeboxeanowxea) 一 3 0 一 wl | =: 帮 . (S$.11) 


We thus obtain Q， the rotational velocity expressed in the stationary frame, by the 
back transformation (5.7): 


Q@=om or QTro= 克 . (5.12) 


This is a differential system for CO which, because 玫 is Skew-Ssymmetric, preSserVes 
the orthogonality of &%. The problem is solved - in theory. 


VIH.S.2 Hamiltonian Formulation of Rigid Body Motion 
In order to open the door for effcient numerical algorithms,， we treat the riglid body 
as a constrained Hamiltonian System. 


Position Variables. The position of the rigid body at time t is determined, in view 
of ($.7), by a three-dimensional orthogonal matrix @( 轨 . The constraints to be re- 
spected are thus QQ -了 工 = 0. 


Kinetic Energy，、 As in (5.12)，we associate with C and Q the Skew-Symmetric 
matrix 三 = QQ whose entries wk, arranged as in (3.11), determine the kinetic 
energy by (3.4): 


工 
了 = 了 aiwi + Po2 十 有 wu3) 
For any diagonal matrix 万 = diag(dl, qd2,qs) we observe 
trace (全 厂 全 站 = (da 二 da)oz 上 + (ds +d)o 二 (di +da)o3. 


Therefore, with 


五 三 do2 十 ds， 1 =ds 十 al， 13 一 dl 十 qd OT 从 = 人 呈 dm (9.13) 
好 
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(note that dx > 0 for all bodies that have interior points)，we obtain the kinetic 
energy as 


全 二 了 trace (DT)， (5.14) 


JInserting 佣 三 QITQ, wehave 


全 一 了 trace (QTQODoxQ) = 3 trace (QDQT) ， (S.15) 


Since CQ is an orthogonal matrix， 


Conjugate Variables. We now have an expression for the kinetic energy in terms of 
derivatives of position coordinates and are able to introduce the conjugate momenta 


忆 = 97/60 = CD. (5.16) 
If we suppose to have, in addition to , a potential 77(Q), we get the Hamiltonian 
F(P.g) = 3 trace (PD-LP7) +U(Q). (5.17) 


Lagrange Mujltipliers. The constraints are given by the orthogonality of C, i.e., the 
equation g(@) = Q1Q-T= 0.Sincethis matrix is always symmetric, this consists 
of 二 (7 十 1) = 6independent algebraic conditions, calling for six Lagrange multi- 
pliers. If the expression G(Q)7L 和 in (1.9) is actually computed, it turns out that this 
term becomes the product Q@4, where the six Lagrange multipliers are arranged in a 
symmetric matrix 4; see also formula (TIV.9.6). Thus, the constrained Hamiltonian 
System (1.9) reads in our case, with VL == (9U/6Qij)， 


Q@ = PDr-i 
己 = -VU(Q)-Q4 (Csymmetric) (5.18) 
0 = QQ 开 
Reduction to the Euler Equations. The key idea is to introduce the matrix 
0 一 dws dswz 
Y=QIP=QroD=D=| ds 0 -dawl | ， (5.19) 
一 Qlwo2 do2wl 0 


where the wx can be further expressed in terms of the angular momenta 人 三 大 wk 
Using the notation skew (4) = 地 (4 一 47), we see, with (5.13), that 


0 一 办 镶 
7 一 了 ==2skew(7Y)=| 四 0 玉 (35.20) 
到 急 0 


contains just the angular momenta. Moreover，DY is Skew-symmetric. By (5.18) 
the derivative of Y is Seen to be 
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Y=QIP+QP=D-IPIP-QIVU(QG)-4= DrI77Y-QIVYI(Q) 一 4. 


Taking the skew-symmetric part of this equation, the symmetric matrix 4 drops out 
and we obtain 


skew (7) = skew(D-IY77Y) - skew(QIVU(Q))， (3.21) 


These are, for L = 0, precisely the above Euler equations, obtained a Second time. 


VJIL.S.3 Rigid Body Integrators 


For a numerical Simulation of rigid body motions，one can either Solve the con- 
strained Hamiltonian System (3.18), or one can Solve the differential equation (9$.21) 
for the angular momentum 7( 芭 in tandem with the equation (3$.12) for @C( 轨 . We 
consider the following approaches: (ID an effticient application of the RATITLE algo- 
rithm (1.20), and (JD various splitting methods. 


(D RATITILE 


We apply the symplectic RATITLE algorithm (1.26) to the System (3$.18), and rewrite 

the formulas in terms of the variables Y and Q. This approach has been proposed 

and developed independently by McLachlan 儿 Scovel (1995) and Reich (1994). 
An application of the RAITLE algorithm (1.26) to the System (3.18) yields 


几 用 
Pa = 三 20 一 5VU(@o) 三 God4o 


Q1 Q@o 十 PPiyaD-1， QTQ@i = 工 (5.22) 
玉 玉 二 全 
中 = Ra-avU(@J- zi QTPRD TD Qi=0， 


where both 4o and 41 are symmetric matrices. We let Y0 三 QI Pu， 到 = Qf 书 ， 
and 2 = QI 书 j2D-I. We mnultiply the first relation of (3.22) by @Q0 ,the last 
relation by CQ , and we eliminate the symmetric matrices 4o and 41 by taking the 
skew-symmetric parts of the resulting equations. The orthogonality of QI Qi = 
7 十 ]2 implies 1272 = -(2 + 2 ), which can then be used to simplify the last 
relation. Altogether this results in the following algorithm . 


Algorithm S.1. Zef Qo pe ortjogo1al ad DYo pe SKew-sy1a111etric，O1e SteP 
(@o, 30) 一 (Qi 玉 ) oftpe etpod tpen yeads as jllows: 


-Jid 0 SUcj 11at7 十 凡 ON ortogomal da1d 
Skew (2DD) = Skew (20) 一 2 skew ( 1YT(Qo))， (3.23) 


- co1iptie QI1 = Qo( 十 RON)， 
一 CO11DUie 2 SUCH Iat 万 说 1 SKew-Sy11111e1ric G11G 


skew (六 ) = skew(2D)- skew(( 2 +2I)D) 一 2 skew (QT YU(Q1)). 
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The second step is explicit, and the third step represents a linear equation for the 
elements of 六 . 


Computation of the First Step. We write for the known part of equation (5.23) 


记 0 FS (你 
skew (2) 一 本 Skew (QI YU(Qo)) = | aa 0 -al | =4 (5.24) 
一 Q2 CT 0 


and have to Solve 
5(ZD -DZ0I=4，(U+HZDU+HAZ) = 了 3(ZD+ DZ7) =5 


(the trick was to add the last equation with 9 an unknown Symmetric matrix). Elim- 
ination gives 了 = (4+ 9)D-land27 = Dr-1(9 - 4). Both inserted into the 
Second equation lead to a Riccati equation for 9. There exist efficient algorithms 
for Such problems; see the reference in Sect, IV.3.3 and a detailed explanation in 
McLachlan & Zanna (2005). 


Remark S$.2 (Moser-Veselov Algorithm). An independent access to the above 
formulas ls given in a remarkable paper by Moser 儿 Yeselov (1991), by treating 
the rigid body through a discretized variational Principle,， Similar to the ideas of 
Sect. YI.0.2. The equivalence is explained by McLachlan 多 Zanna (2005), follow- 
ing asuggestion of B. Leimkuhler and S. Reich. 


Quaternions (Euler Parameters)， An efficient implementation of the above algo- 
Tithm requires Sujtable representations of orthogonal matrices, and the use of quater- 
nions ls a standard approach. 

After having revolutionized Lagrangian mechanics (See Chapt.VJD)，Hamilton 
struggled for years to generalize complex analysis to three dimensions. He fnally 
achieved his dream, however not in three dimensions, but in jovx7, and founded in 
1843 the theory of quaternions. 

For an introduction to quaternions (whose coefficients are Sometimes called 
Euler parameters) we refer to Sects.IV2 and IV.3 of Klein (1908), and for their 
use in numerical Simulations to Sects.9.3 and 11.3 of Haug (1989). Quaternions 
can be written as e = eo 十 Yel 十 ITe2z 十 Keas, where multiplication is defined via the 
Telations 这 一 关 二 了 二 一 条 二 放 庆 一 克 大 一 力 and 到 一 一 克 7 一 一 让 








和 一 一 7. The product of two quaternions e. 三 (written in matrix notation) is 
60 3 Jj0 
(eo 十 2e1 十 ITe2 十 Ke3) 四 万 (5 25) 
(十 访 十 7 户 十 K 方 ) 户 
j3 





We see (in grey) that in dimensions 1,2,3 appears a Skew-Symmetric matrix 耳 
whose Structure ls familiar to us. This part of the matrix changes Sign 夺 the two 
factors are permuted. 
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An important discovery, for three dimensional applications of the quaternions， 
is the following: 让 a quaternion Dp is a 3-vector (i.e., has po0 一 0), then 2 一 e.D.Eis 
a 3-vector too, and the map p 一 D/ is described by the matrix 





0 一 CE3 CE2 
Q(e) = 已 一 e3 0 ”一 el (3.20) 
人 CE1 0 


whereE 一 eo 一 iel 一 Jez 一 Kesandllel?=e.E=ei 十 时 十 e2 十 e3. 


Lemma S$.3. 矿 | 上 ell = 1 ten te matrixr QUe) 1 ortpogonaL Every ortpogozal 
1a11ix wii det@ 三 1camn pe written 太 1 友 ij WaveQlele( 六 =Qle 门 so 
11a1 1jpe 1IULDLication of ortpogo11al 11G1rices coFrespo1ds 1o 1Pe 11IULPDLiCcatio1z OF 
QUG1e1727O118. 

Geome1ricaliy tpe aatrir Q efjects ayotation around ipe axi es = (eliea)e3) 
Wi 7otation angle D Wicp safisjies tan(P/2) = |sllyeo， 


Proof The condition QILQ = 7 can be verified directly using BE = 一 已 and 
五 3 = 一 (ez 十 e2 十 e 扫 媚 . The reciprocal statement is a famous theorem of Euler; 让 
is based on the fact that s is an eigenvector of Q, which in dimension 3 x 3 always 
exists. The formula for Q(eJQ(jJ) follows from e. 户 D. 六 E = (e. 帮 .D.(e. 帮 . 

The geometric property follows from the 
Virtues of the exterior product，because by @。 
(S$.1) the matrix C maps a vector Z to 


2 十 2eosxzZ 二 2Esx(ex2z)， 


This consists in a rectangular mouvement im 
a plane orthogonal to s; first vertical to Z by 
an amount 2eo ||s|| (times the distance of z)， 
then parallel to z by an amount 2||<||?. 
Applying Pythagoras” Theorem as (2eo |sl)2 二 (2zl 上 2 一 12 = 1,itturns out 
that the map is norm preserving if ed 十 |zl2 = 1. The angle 2/2, whose tangens can 
be seen to be ||sll/eo, is an angle at the circumference of the circle for the rotation 
angle op at the center. 

















For an efficient Implementation of Algorithm 3$.1 we represent the orthogonal 
matrices Co Qi, and 了 十 2 by quaternions, This reduces the dimension of the 
Systems, and step 2 becomes a Simple multiplication of quaternions. For Solving 
the nonlinear System of step 1, we let 了 十 12 = Q(e). With the values of oa 
from (5$.24) and with skew (1GD) = 2eo skew (万 D) 十 2skew ( 忆 2D),the equation 
(9.23) becomes 


几 a1 11el (73 or 12)eze3 
AQ2 一 2e0 72e2 十 2 (五 = 13)jesel ) (9.27) 
用 aQ3 73e3 (72 四 厂 )ele2 


Which, together with ed 十 ez 十 e2 十 e3 = 1, represent four quadratic equations for 
four unknowns. We Solve them very quickly by a few fxed-point iterations: update 
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Symmetric body non-Symmetric body 





force free body 





heavy body 




















Fig.S.3. Numerical solutions of the Tigid body equations; withoutwith gravitation， 
with/without symmetry. Initial values yio 三 0.2 ,ya2o 三 1.0,Vaso = 0.4; initial position 
of Qo determined by the quatenion eo = 0.4,el = 0.2 ,ez = 0.4,es = 0.8; moments 
of inertia 万 = 0.5, 7 = 0.85(0.5 in the symmetric case), 173 = 1; step Size 几 三 0.1， 
integration interval 0 < 二 区 30 


Successively ei from the ?th equation of (3.27) and then eo from the normaliza- 
tion condition. A Fortran Subroutine RATORI for this algorithm is available on the 
homepage <http:/www.unige.ch/~hairer>. 


Conservyation of Casimir and Hamiltonian. It is interesing to note that, in the ab- 
sence of a potential, the Algorithm 5$.1 preserves exactly the Casimir 邮 十 地 十 ? 
and, more Surprisingly, also the Hamiltonian 二 (08/ 厂 十 吧 / 有 十 好 /03). This can 
be seen as follows: without any potential we have skew (2) = Skew(2D) and 
Skew ( 玉 ) 一 一 skew(27D),sothatthe vectors (yio,yao,yao)7 and (il yal;yasl) 并 
are equal to 内 十 uv andv 一 v,respectively, where and v are the vectors of the right- 
hand side of ($.27). Since and w are orthogonal, we have | 十 z = | 一审 |， 
Which proves the conservation of the Casimir. 

To prove the conservation of the Hamiltonian，we first multiply the relation 
(5.27) with G = diag(1/Vv,1/V72,1L/V7s),andthen apply the same arguments. 
The vectors Gu and Cu are Still orthogonal. 








了 xample S.4 (Force-Free and Heavy Top). We present in Fig. 9.3 the numerical 
Solutions Vi obtained by the abpove algorithm. In the case of the heavy top, we asSume 
the centre of gravity to be (0,0,1) in the body frame, and assume that the third 
coordinate of the stationary frame is vertical. The potential energy due to gravity is 
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then given by V(Q@) = dss and, expressed by quaternions (5.26), itis 7 = ea 一 
2 
el 一 e2 十 63. 


(ID Splitting Methods 
As before we consider the differential edquation (5.21) for the angular momenta in 


the body V1,V2,V3a together with the differential equation (5.12) for the rotation 
matrix @. An obvious splitting in the presence of a potential is 


2jj2 9 天 9PJ12 (5.28) 


where PpU represents the exact flow of 


Q=(0， skew (Y) = 一 skew(Q7VU(G))， 


and G7 jls a Suitable numerical approximation of the System corresponding to Ki- 
netic energy only, ie., without any potential V7(Q). The use of splitting techniques 
for rigid body dynamics was proposed by TIouma 狠 Wisdom (1994)，McLach- 
lan (1993), Reich (1994), and Dullweber Leimkuhler & McLachlan (1997). Fasso 
(2003) presents a careful study and comparison of various ways of splitting the Ki- 
netic eneTgy. 


Computation of 57/ . We do this by splitting once again, by letting several moments 
of inertia tending to infinity (and the corresponding wi tending to zero). In order 
to avoid formal difticulties with infinite denominators，we write the System (3.10) 
together with (3.12) in the form 


21 0 -0% 纺 7 (9) 
办 |=|5 0 -一作 Too(y) (5.29) 
13 一 2 ”人 久 0 Ts (9) 
0 一 了 os () 22 () 
一 To 人 JTJ 0 


WhereT(y) = 去 (4/ 石 十 吧 / 十 好/7) is the kinetic energy, and Tv, (y) denote 
partial derivatives. 


Three Rotations Splitting. An obvious splitting of the kinetic energy is 
T( 轨 = 尼 ( 轨 + 否 ( 切 + 永 恕 ，， 尼 急 = 风 /(C20)， 0G.30 
which results in the numerical method 
于 一 1 和 op12o 21 op12o 01 为 ， 


where of is the exact flow of (5.29)-(5.30) with T(y) replaced by Ri(y). The flow 
0 二 ls easlly obtained: V1 remains constant and the Second and third equation in 
(9.29) boil down to the harmonic oscillator. We obtain 
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y 罗 = Saby0D， Q@O=Q(0)S(ob7 (5.32) 
with aw = Wi(0)/ and the rotation matrix 
工 0 0 
9(0)=|0 cos0 sinb0 
0 一 Sin0 cos0 


Similar Simple formulas are obtained for the exact fows corresponding to 已 > 
and 尺 3. 


Symmetric + Rotation Spjlitting. It is often advantageous, in particular for a nearly 
Symmetric body (71 之 72), to conslder the Splitting 


1 


zg=RO+sg，Rg = ( 工 - 工 ) 闪 ，sg= 人 机 ) 





and the corresponding numerical integrator 
代 尺 3 
2 二 2jP/2 9 92 六 9 2P/2， 


The exact fow oOR is the same as (5.32) with 厅 replaced by 厅 一 万 1 Thefow 
of the symmetric force-free top 2 possesses simple analytic formulas, too (see the 
first picture of Fig. 9.3): we observe a precession of y with constant Speed around a 
cone and a rotation of the body around w with constant Speed. Therefore 


的 = By0)， Q@OD=Q@(O)4GDB(p0 ， (5.33) 
where8 = (万 上 一 万 0)ya(0),and 


1 0 “一 %a(0) yz(0) cos0 Sin0 0 
4( 伯 = exp 芭 23(0) 0 ”一 吧 (0) ] ，B(0)=|1-sin0 cos0 0 
“”\-%(0) 00) 0 0 0 1 


This can also be checked directly by differentiation. 

Similar to the previous algorithm it is advantageous to use a representation of 
the appearing orthogonal matrices by duaternions. The correspondence between the 
orthogonal rotation matrices appearing in (5.32) and (5.33) and their quaternions ils， 
in accordance with Lemma 5$.3, the following: 


5S(0 7 人 cos(0/2) 二 isin(01/2) 
妃 (0) 全 cos(0/2) 十 sin(0O/2) 
4 全 cos(91/2) 二 arlsin(9/2)(z01(0) 十 Jo2(0) 十 Foys(0))， 








where ua = Vi(0)2 十 (0)2 十 %a(0)2 and 9 = at/ 厂 . The matrix multiplica- 
tions in the algorithm can therefore be done very efficiently. A Fortran Subroutine 
QUATER for the“symmetric + rotation Splitting”algorithm is available on the 
homepage <http:/www.unige.ch/~hairer>. 
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VJII.S.4 Lie-Poisson Systems 


In Sect.VII.5.1 we have Seen that the reduction of the equations of motion of the 
Tiglid body leads to the Poisson System (9.10) with a Structure matrix whose entries 
are linear functions. Here we conslder more general Polisson Systems 


= BO)V 万 (y)， (5.34) 


Where the structure matrix 妃 (y) depends linearly on V, i.e.， 
(办 =》 0 网。 for 训 1 (5.35) 
K=1 


Such Systems, called Lie-Poisson Systems, are closely related to differential equa- 
tions on the dual of Lie algebras; See Marsden 久 Ratiu (1999), Chaps. 13 and 14， 
for an in-depth discussion of this theory. 

Recall that a Lie algebra ls a vector Space with a bracket which is anti-symmetric 
and satisfies the Jacobi identity (Sect.TIV.0). Let 环 1,..., 玉 be abasis of a vector 
Space, and define a bracket by 


[本 = 》 及 (5.36) 
开 一 1 


with CN from (S.35). Hthe structure matrix 已 (V) of (5.35) is skew-symmetric and 
Satisfies (2.10), then this bracket makes the vector Space a Lie algebra (the verifica- 
tion ls left as an exercise). The coefficients CC are called strxwctwre co11sta11sg of the 
Lie algebra. Conversely, i 夺 we Start from a Lie algebra with bracket given by (5.30)， 
then the matrix 妃 (y) defined by ($.35) is the structure matrix of a Poisson bracket. 

LetgbeaLie algebra with abasis 已 1,..., 忆 nandlet gbethe dual ofthe Lie 
algebra, ji.e., the vector Space of all linear forms Y : g 一 疏 . The duality is written 
as ( 玉 X)foryr egandXeg. Wedenoteby 帮 ,zthedualbasis defined 
by ( 瑚 ,五 )) = 0 the Kronecker 0. 


Theorem S.S. ZLel g pe ad ZLie dlgep1a wii pasiy 已 1 卫 Satisji08 (5.30) 7D 
4 三 (op)T E 了 ”we associate 了 一 后 | V1T1 E ga1d we co1sider aq 
armilionian6 (y) = 万 (也 ). 

77je1m 加 e Poissom syste1i V 三 已 (IIV 万 (人 mwi 妃 (人 given py (5.35) 18 equiv- 
ale1t 1o 1je jollowizg dierential eqguatio1n o1z 1pe dual 9 


(YX)=( 冯 [BO XI Joral Xeg， (5.37) 





Where 五 (7) = > 六 1 2 五 让 


6 We use the same symbol 刀 for the functions 刀 :了 R" 一 及 and 万 :g 一 玉 . 
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Pioof Differentiating 瓦 (y) = 囊 ( 了 ) with respect to Wi gives 


一 ( 玉 , 末 (六 ) and 刀 '(Z) = > 2 


= 上 





了 





Here we have used the identification (g")* = g, because 万 '( 了 ) is actually an 
element of (g)*. With this formula for 万 (了 ) we are able to compnute 








竹本 一作 三 二 本 本)- 革 人 且 ) 


Where we have used ($.36). Since (Y， 五 ) 三 丰 and(7 玉 ) 三 this shows that 
the differential equation (3.37) is equivalent to 


包 亿 


(oo 


7 一 1 大 一 

















which is nothing more than 7 = 妃 (y)V 五 (y) with 已 (y) from (5.35). 
We remark that (3.37) can be reformulated as 


了 一 ad 责 (7 六 


where ad% is the adjoint of the operator ad 4(X) = [4, 民 |. 

Equation (3.37) 18 Similar in appearance to the Lie bracket equation 五 一 
[4(D), 如 = ad4(z)z of SectIV.3.2. When g is the Lie algebra of a matrix Lie 
group CQ, then solutions of that equation are of the form 了 = AdrdDZo where 


AdvX = UXU-1Li (5.38) 


See the proof of Lemma IV.3.4. Similarly, for the Solution of (3.37) we have the 
following. 


Theorem 5S.6， Co1zsidera71aa1rixZiegrotp GwWiiZLiealgepra g7TP1em 加 eyoutiom 
YE 和 oj.37 Wi 太 pitialyvalke YE 区 MgivenDy 


(了 (人 X) = 人 (TDTIXUN) Joral Xeg， (5.39) 
WApere U(b EGGsatisjies 
过 = 一 BY 7(0) = 工 (5.40) 
Equation (9$.39) can be written as 
工 坟 三 Adrto-iyo， 


where Ady-iis the adjoint of Adrv-:. The solution 闻 (tb of (5.37) thus lies on the 
CoadJoimat o7BpL 
YElflAdr- iecG (5.41) 


In coordinates 正 一 六 1 功 机 ,wenoteg 态 三 (0 太一 CO) 
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Prooj_ Differentiating the ansatz (3.39) for the Solution we obtain 
(了 YX) = ( 切 ,-UTIUUT-IXU 二 DTIXU) 
= ( 列 ,0 一 PDT] 功 = 作 区 VD)， 


Where we have used (3.39) in the first and the last equation. This shows that (3.37) 
is satisfied with the choice VU-L = 一刀 (2Z). 














上 xample S.7 (Rigid Body). The Lie group corresponding to the rigid body is 
SO(3) with the Lie algebra 50(3) of skew-symmetric 3 x 3 matrices, with the basis 


0 


0 0 0 
下 =-|00 -1|1， 画 =|00 
0 1 0 5 


0 1 0 -1 0 
0 0|1， 画 =|1 0 0 
0 0 0 0 0 
If weletz = (zl,zo2,zZ3)7 be the coordinates of X <e 50(3), then we have Xu 一 
ZXxuforalve 了 3.SinceforUV < SO(3)， 


Dr-IXU = TUTIzxUo=UTizx 


the vector U-1Z consists of the coordinates of -1IXU E5o0(3). 
Lety = (Wi,y2,y3) 三 be the coordinates of Y Ee 50(3)* with respect to the dual 
basis of 已 1, 已, 尼 3. Since 


3 
( 郊 I 一 XU) = 3 2j1T7， > (Co) 矶 ) = 多 UTz = (Oo， 


了 一 1 1 一 1 


the coordinates of Adrv-iy are given by the vector LV/V. Therefore, the coordinates 
of the coadjoint orbit of Y lie on a sphere of radius ||yl. The conservation of the 
coadjoint orbit thus reduces here to the preservation of the Casimir C(y) = ?2 十 


友 十 邓 . 


Lie-Poisson integrators Seem to be have first been considered by Ge 儿 Marsden 
(1988), who extend the construction of symplectic methods by generating functions 
to Lie-Poisson Systems. Channel & Scovel (1991) propose an implementation of 
these methods based on a coordinatization of the group by the exponential map. 

McLachlan (1993) proposes integrators based on splitting the Hamiltonian and 
jllustrates this approach for various examples of Lie-Poisson Systems. When ap- 
plicable, such splitting integrators yield Poisson integrators that preserve the coad- 
joint orbits, Since they are composed of exact Hows of Lie-Poisson Systems. 

Engy 久 Faltinsen (2001) propose to Solve numerically the Lie-Poisson System 
(3.34) by applying Lie group integrators Such as those of Sect.IV.8 to the differential 
equation (3$.40) with (3.39). This approach keeps the Solution on the coadjoint orbit 
by construction, but it does not, in general, give a Poisson integrator. 
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VII.S.S Lie-Poisson Reduction 


The reduction of the Hamiltonian equations of motion of the free riglid body to 
the Euler equations is an instance of a general reduction process from Hamiltonian 
Systems with Symmetry on a Lie group to Lie-Poisson Systems，which we now 
describe; cf. Marsden 贸 Ratiu (1999), Chap. 13, for a presentation in a more abstract 
framework and for an historical account. 

Let us assume that the Lie group C is a subgroup of GL(z) given by 


and consider a Hamiltonian System on C@， 


P = -voHP9)-> ANXvog(Q)， @=vpHF(PO) 
记 1 (9.43) 


0 二 0i(@)， 1 一 1 上 .77 


Where 书 C are square matrices, and Vo 互 三 (6 万 / 0Q5) . This ls of the form 
discussed in Sect. VII.1.2. In reglons where the matriX 


2(PO) 
(请 


777 


(ve 0i(Q)，Vogi 9))) is invertible， (5.44) 


7 一 1 


the Lagrange parameters Xi can be expressed in terms of 书 and CQ (cf. formula 
(1.13)). Hence, a unique solution exists locally provided the initial values (已 ,Qo0) 
are consistent ie., 9i(@o) = 0 and 


g(Qoj(vpECUP,Qo) = tace(vogi(QorYpE(P Go) =0， 


or equivalently Co e G and VP(P,Qoj se To。G. 

We now assume that the Hamiltonian 妃 is quadratic in 书 . As we have seen in 
Sect.VIL1.2, the equations (3.43) can be viewed as a differential equation on the 
cotangent bundleTCG={PQO) QQEeG PEeTGcj where the cotangent space 
7OC is identified with a subspace of matrices Such that 


忆 e7ToG 直 and only 下 Vp 万 (已 Q@) ETeG. (5.45) 


With this identification, the duality beween TOG and TO@CG is given by the matrix 
inner product 


(已 =trace(PIV) for Pe 7oc, 了 ET7TecG. 
We call the Hamiltonian 1e 万 -zyva7zia7t, 让 


厅 (07PUrIG@)=TPQ) forales G. (5.46) 
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In this case we have 万 (PQ) = 万 (QT 站 and by differentiating we obtain 
Vp 万 (已 Q) = QVPp 末 (QQ 站 By (5$.45) and since ToG = {QX;IX eg 
with the Lie algebrag 三 T7CG (cf Sect.IV.6), this relation implies 


Pe7TzG ifandonyif QT7Pe 他 G= 9 (5.47) 


Now 万 (PPQ) depends, for (已 Q) < T*G, only on the product 丰 = QIPE g， 
and we write 万 (PQ@) = 万 (7) with afunction 万 : 旦 一 及. 

Left-invariant Hamiltonian Systems can be reduced to a Lie-Poisson System of 
halfthe dimension by a process that generalizes the derivation of the Euler equations 
for the rigid body. 


Theorem S.8， Co1sider Ga 已 azailfomia1z 8Syste11 (3.43) o1 Q 1IQ11i Lie 81oUD G 
Wiip a 1ej-ipyariant guadratic Famailonian 万 (已 Q) = 万 (站 jpor7 = QIP. 太 
(PNQO) ETYGiasolution ofthe sysfema(5.4 了 1peny 轴 =QOTPOEe 和 对 
Solves 1pe dijjerenlial eguatio1 (5.3/) 


Prooj It is convenient for the proof (though not necesSary， See the lines follow- 
ing (2.17)) to extend the Hamiltonian 媚 : 入 一 了 toatfunction of arbi- 
trary matrices 了 by setting 万 (了 7) = 互 (TY)， where 7 is the projection onto 
ggiven by (三 及 X) = (也 X) foralX E g with the matrix inner product 
( 冯 X) 一 trace(Y7X). 

We first compute the derivatives of 万 (已 Q) = 万 (了 7) = 万 (TY) = 万 ( 切 
where QI P = YY and, using the notation of Theorem 5.35,， IT7Y == 王 和 5 她 玉 
Since yj = (ITQ7P 万 ) = (QTP 人 it follows from VAatrace (47 碌 ) = 已 that 


Q 








万 
Vp 万 (PQ) = 2 节 Vptrace(PTQ 万 ) = Q 克 (2 切 )， 
| | 
(5.48) 
Where 万 '( 了 ) = 滞 本 一 E gasin Theorem 5.3. Using the identity yj 一 
trace (P7Q) = trace (QT PP7 ) we get in a similar way 
Vo 末 (PQ) = 已 厅 ( 人 (并 (3.49) 
Consequently, the differential equations (53.43) become 
已 = 一 PEF(QIP)7 谤 Ni Vooi(Q Q = QH(O7P).，， (5.50) 
The product rule 立 一 QT 十 QT7 忆 for 关 = Q7LPthus yields 
站 = 丽 (77 开 -》 NGrvooi(Q). (5.51) 


一 工 


7 We use again the same letter for different functions. Since they have either one or two 
arguments, no confusion Should arise. 
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FEor X E g, we now exploit the properties 
(9@Tvogi(@),X) = 〈Vvogi(@),QX) =0 ecauseQX EToG) 


《[ 丽 (区 ) 罗 玖 ,X)》 = trace((Y7 (7Y) 一 吾 (7)YT)X) 
trace (Y7 ( 丽 ( 了 ) 和 一 六 末 (Y))) = (到 [本 (2 司 )》 





SinceY( 人 (Eggforalltthis gives the equation (3.37). 











Reduced System and Reconstruction. Combining Theorems 9.8 and 5.3, we have 
reduced the Hamiltonian system (5.43) to the Lie-Poisson system for y(b E 下 1， 


三 已 (OO)V 万 (人 )， (5.52) 


of half the dimension. To recover the full solution ( 尸 (人 ,CQ 的) E 7TG we must 
Solve this System along with 


4=9F)，P=Q9Y (5.53) 
whereY = 浊 : 本 717 E g 


Poisson Structures。 The Poisson bracket on 玉 4 defined by 已 (y) is still closely 
related to the canonical Poisson bracket on 及 2" . Consider left-invariant real-valued 
functions 天, 二 on 7T”*G. These can be viewed as functions on 7T7G/G = 入 C 
术 ?x<m 

K(PQ)=KY) for 闻 =Q7P. 


(As we did previously in this section, we use the Same Symbol for these functions.) 
Via the projection 三 : 及 "一 多 used in the above proof，we can extend 
开 (7) = 开 (CTY)toarbitrary7x7mmatrices 了 ,and via the above relation to a 
left-invariant function 天 (已 @) on 陈 " xm x 到" on Which we have the canonical 
Poisson bracket 





天 ,也 on = 
{ ) ] (5 DOP. DOPu | 


On the other hand, we can view 及 as a function on 了 及 4 by choosing coordinates 
on g”， 
开 ( 几 = 天) for 下 = 》 芒 万 Eg 
7 一 1 


On 到 4 we have the Poisson bracket defined by the structure matrix 巨 ()， 


o 
{, = 沪 1 岂 下 


有 


292 VIL. Non-Canonical Hamiltonian Systems 


Lemma 5$.9. LO Leji-IvG7ia12t Jactio1ns 开 , 了 ay descriped apove，wWe jave Jor 
QILP = 并 = 二 107117 Eg 


{ 开 丰 (O = 区 (天 (CD)]) ={ 人 ,70PQ) 
Where 天 () 一 3 1 可 用 (7/ )Pi EUg. 
Proof The first equajity follows from the identity 
2ij(9) 三 《 殉 [ 尼 )， 万 j]) ， 
which is a direct consequence of the definition (3.39$) with (3.36). For the Second 
equality, the relations (9$.48) and (3.49) for 人 and 志 yield 
{ 开 eana( 已 Q) = trace (天 (了 7 天 (2 一 天 (77( 攻 ) 妆 
= trace (天 (了 了 (一 天 (2Y KZ)) 
= tace ( ZTE, 瑟 ()]) =( 忒 区 (2 K(])， 


人 
人 





which is the result. 











Discrete Lie-Poisson Reduction. Consider a Symplectic integrator 
(Pi = 到 (Po ,GOo) on TY*CG 


for the left-invariant Hamiltonian System (3.43), and Suppose that the method pre- 
Serves the left-invariance: 证 (Po,Qo) = (已 ,Qi,then 


Gin(U07PiU-IQ@o) = (0 PUTIQ@U foral Te G. (5.54) 


For example, this is Satisfied by the RATTLE algorithm. The method then induces a 
one-step map 

1 一 奈 (Yo) On 
by setting 交 = Qf PP for (PQ = 于 (PPQo with Qi = 巧 .Thisis a 
numerical integrator for ($.37), and in the coordinates y 三 (VJ) with respect to the 
basis ( 芒 ) of g” this gives a map 


邦 王 如 (yo) on 形 "， 
which is a numerical integrator for the Poisson System (3.32). 


了 xample $.10， For the rigid body，applying the RATTLE algorithm to the con- 
Strained Hamiltonian System (3.18) yields the integrator for the Euler equations 
discussed in Sect.VJIL.3.3. By the following result this is a Poisson integrator. 


Theorem S.11. 矿 G1 (已 Q@) masyaplectic and 1ef-iavariant integratorjor(5.43) 
ipem it yeduction (yy) 1 a Poissom 12aD. 
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Pioof We write VWhn as the composition 


几 : 有 TeG 2 T*G -有 4 





wherem = (7) is the function with ij(PQ) =WftorQP= 三 9 V1Ph andE 
is any Tight inverse of m, i.e., 7 o = id. For arbitrary smooth real-valued functions 
天, 二 on 了 4 wethenhavefor (已 @) = 和 (using Lemma 5.9 in the outer equalities 
and the Symplecticity of On in the middle equality， 


{ 攻 oowp = {Ko7ogho7oG1hpjcan( 忆 OO) 
{ 开 & 1 也 吕 1jcan(@o(P,GQ)) 二 {， 卫 } (人 ). 


This equation states that Wn ls a Poisson map. 














A similar reduction in a discrete Lagrangian framework is studied by Marsden， 
Pekarsky 儿 Shkoller (1999). 

The reduced numerical maps wh and have further structure-preserving prop- 
erties: they preserve the Casimirs and the co-adjoint orbits. This will be Shown in 
Sect.IX.3.3 with the help of backward error analysis. 


VIIL.6 Reduced Models of Quantum Dynamics 


To incorporate quantum effects in molecular dynamics Simulations，computations 
are done with models that are intermediate between classical molecular dynam- 
ics based on Newton' s equations of motion and full quantum dynamics described 
by the NW-particle Schrodinger equation. The direct computational treatment of the 
latter ls not feasible because of its high dimensionality. These intermediate mod- 
els are obtained by the Hamiltonian reduction (2.17) from an infnite-dimensional 
Hilbert Space to an appropriately chosen manifold. In chemical physics, this re- 
duction is known as the Dirac-EF7remkKel 1711e-depe1ident VG1iIGLio1GL PHiCIDLe. We 
ilustrate this procedure for the case where the quantum-mechanical wave function 
is approximated by a complex Gaussian as proposed by Heller (1975). It turns out 
that the resulting ordinary differential equations have a Poisson Structure，which 
WwWas Tecently described by Faou 儿 Lubich (2004). Following that paper， we derive 
a structure-preserving explicit integrator for Gaussian wavepackets, which tends to 
the Stormer-Verlet method in the classical limit. 


VJII.6.1 Hamiltonian Structure of the Schrodinger 了 Equation 


The introduction of wave mechanics stands .… as Schrodingers monu- 
Iment and a worth one. (From Schrodinger's obituary 
in 7jpe 7imres 1961; quoted from http:W/www-groups.dcs.Sstrand.ac.uk/ his- 
tory/Mathematicians/Schrodingerhtml) 
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The time-dependent V-body Schrodinger equation Teads (See, e.g., Messiah (1999) 
and TIhaller (2000)) 
OUY 


ie 开 一 瑟 % (6.1) 


for the wave function 凤 = W(z, 加 depending on the spatial variables Z = 
(Z1)...,ZN) with ZK E 及 4 (e.g., withw = 1or3inthe partition) andthe tme 
t E 下. The squared absolute value |W(z, 妨 |”represents the joint probability density 
for N particles to be at the positions Z1,... ,ZNattmetIn(6.l),sisa(Smal) pos- 
itive number representing the Scaled Planck constant and ; ls the complex Imaginary 
unit. The Hamiltonian operator 束 1 written 


媚 王 不 十 信 


with the kinetic and potential energy operators 


N 5 
E 
六 = 一 》 一 一 4。 d 人 太 =T(z)， 
2 2771 忆 


Where 7 > 0is aparticle mass and Az，the Laplacian in the variable ZK E 及 7， 
and where the real-valued potential Y acts as a multiplication operator (Vg)(Z) 三 
VY(z)b%(z). Under appropriate conditions on 了 (boundedness of is sufficient， 
but by no means neceSSary)，the operator 囊 is then a self-adjoint operator on 
the complex Hilbert space 72(Rv,C) with domain 万 (万 ) = DT) = {0 < 
72( 了 RN CiToOEe 2( 了 RN,C)}isee SectV5.3 of Kato (1980). 

We separate the real and imaginary parts of 罗 = 二 iu E (了 RN Cj, the 
complex Hilbert Space of Lebesgue square-integrable functions. The functions v and 
WwW are thus functions in the 7real Hilbert space 厂 ?( 了 4 ,及 ). We denote the complex 
inner product by (.,.》and the real inner product by (.,.). The 72 norms will be 
simply denoted by || .||. 

As 万 is areal operator formula (6.1) can be written 


5 三 万 


< 二 一 末 w， 人 


or equivalently, with the canonical Structure matriX 


0 一 | 
and the Hamiltonian function (we use the Same Symbol 妃 as for the operator) 


1 半 工 
五 (wo) 一 本 ( 仿 如 录 5(2, 五 JW 让 本 (Ww, 五 U) 
for 允 = 十 ?0 in the domain of the operator 妃 . This becomes the canonical 
Hamiltonian System 
“ 】 = ce-IJIV 玉 (oa 


?0 
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Note that the real multiplication with . corresponds to the complex multiplication 
with the imaginary unit z. The How ofthis System preserves the canonical symplectic 
two-form 


wa 人) = (Vi 6)， ea e 有 (了 人 ,及 )2. (0.3) 


VII.6.2 The Dirac-Frenkel Variational Principle 


For dealing with atoms involving many electrons the accurate quantum 
theory, involving a solution of the wave equation in many-dimensional 
Space, is far too complicated to be practicable. One must therefore resort 
to approximate methods. (PA.M. Dirac 1930) 


Reduced models of the Schrodinger equation are obtained by restricting the equation 
to an approximation manifold 人 4 via (2.17), Viz.， 


(s. 几 一 V 瑟 =0 foral 和 evA1， (6.4) 
or equivalently in complex notation forv 一 (wz) 一 了 十 io， 
Re(s 记 一 下 E)=0 forall Se 人 4. (6.5) 


Taking the real part can be omitted if the tangent space TvA4 is complex linear, 
Equation (6.3) (usually without the real parb ls known as the Dirac-Frenkel time- 
dependent variational principle,， after Dirac (1930) and Frenkel (1934); see also 
MecLachlan (1964), Heller (1976), Beck, Jackle，Worth 多 Meyer (2000), and ref- 
erences therein. 

We choose a (local) coordinate map = X(V) of A4 and denote its derivative 


Xcl(y) =VO) + =X(y) orinthe real setting as X 王 ( 和 . Denoting 


厂 
by X7 the adjoint of X with respect to the real inner product (., ), we thus obtain 


sX()7 TITX(O)I = X(O) Yu(x(y)， 


With XK denoting the adjoint of Xc with respect to the complex inner product (.，》， 
Wenote XEXc = (VT7 +T 有 )TiVT 权 一 TY)=XTX 一 XIJXand 
hence 

X7JX = -ImXzEXc. (6.0) 
Lemma 6.1. 矿 T AM NS Ga complex Lear Space jor evemyu E 人 1jen ANa 
Sy11Djectic SUBD1aa1ijo1d of 72( 阴 N ,要 )2，1Pat ziPe symaplectic jpo- 加 171a (6.3) 让 
1O1-dege1lerale oO Tu Aporall E A 人 1 人 .Expressed z1I coo1di1zatey， 


X()7TX(ON ivertibple Joral1 yy 
Poof We fx = X(J) E Af and omit the argument y in the following. Since 


TuAM = Range(Xc)j is complex linear by assumption, there exists a real linear 
mapping 二 :下 史 一 下 交 SuchthatzXc7 三 XcZmforal7 e 开 7.This implies 


JX=XTI and 天 = 一 Id 
and hence X7T.JX = X7TXT, which is invertible. 
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Approximation properties of the Dirac-Frenkel variational Principle can be ob- 
tained from the interpretation as the ortpogoal! projection 也 = 书 | (让 ) 去 刀 上 Which 
corresponds to taking the imaginary part in (6.3), as opposed to the Symplectic pro- 
jection in (6.4) which corresponds to the real part. See Lubich (2005) for a near- 
optimality resujlt for approximation on the manifold. 


VJI.6.3 Gaussian Yavepacket Dynamics 


We develop anew approach to semiclassical dynamics which exploits the 
fact that extended wavefunctions for heavy particles (or particles in har- 
Imonic potentials) may be decomposed into time-dependent wave pack- 
ets, which spread minimally and which execute classical or nearly classi- 
cal trajectories. A_ Gaussian form for the wave packets is assumed and 
equations of motion are derived for the parameters characterizing the 
Gaussian. (E 上 .J. Heller 19759) 


The variational Gaussian wavepacket dynamics of Heller (1976) is obtained by 
choosing the manifold 人 4 in (6.5) as consisting of complex Gaussians. For ease 
of presentation we restrict our attention in the following to the one-particle case 
NV =1(theextensionto N > lisstraightforward; cf. Heller (1976) and Faou 多 
Lubich (2004)). Here we have 


AMM=f{v=X)e 2(R4C):y= (pda 0EeR2 withO>0} 
(6.7) 
with 


(xD)) (z) 二 exp(2((a + 字 )lz 一 o 二 DZz 一 人 十 Y 十 刘 )， (6.8) 


where | . | and .stand for the Euclidean norm and inner product on 玉 4. The pa- 
rameters qd and P Tepresent the average position and momentum, respectively: for 
凡 一 X(y) withy = (2;,9q,a;,0,7y,6) and lv = 1 adirect calculation shows that 


=(wozo= 上 人 Zu(z)|2 dz ， 了 三 (2 一 人 EVL)》， 
及 < 


The parameter 0 > 0 determines the width of the wavepacket, The tangent Space 
Tu C (了 4C) atagiven pointv = X(J) < Ais (2d 十 和 -dimensional and is 
made of the elements of 二 2( 及 4,C) written as 


(4+ 训 B)p- 时 +(P-2a+ 洞 9- 四 -9+C+iD)w (69) 


with arbitrary (已 @, 4,B,C,D)7 ee 了 24+4. We note that TuAM is complex linear， 
anduw E TABychoosingE= 记 ipn(6.3),this yields (d/dbluwl2=2Re(awy) 一 
0 and hence the preservation of the squared 志 2 norm of v = X(y),， which is given 
by 
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3 

人 

SS 
| 


加 站 一 上 |w(z)|? QZ (6.10) 


2 26\ TeN2 
定 人 sp(C2ee-oP+)w=em(-) ( 纪 ) 


The physically reasonable situation is ||uw||2 = 1, which corresponds to the interpre- 
tation of |w(zZ)|> as a probability density. 
With these preparations we have the following result of Faou 儿 Lubich (2004). 


Theorem 6.2. 77Pe G11azzltomia1 7eduction ofipe Sc1prodijzgereguation 1o te GaUSSiG1 
WGVEeDPaCKet 1IG111Q 人 of6.7)-(6.8) yields 1je Poisso1z sys1e11 


= BO)VK(Uy) (6.11) 


WwWjere, jporgy = (pda 6) E 了 2444 wiiD > 0 and mw 就 1o denoting the 
Q-di11ze11yio11al ide12tity 


0 二 
1 0 0 0 0 0 
| 4 0 8 
四 5 
| 人 0 
D7 0 0 -4 0 呵 2c 
0 0 0 0 -=- 宇 e 0 
dejipesy a Poissom Structu1e, Ga11d Jor 凡 = X(y)， 
开 () 三 (wo 瑟 Us 三 ET 十 玫 v(Oy) (6.13) 


18 1e 1otal ee18gy wii letic da11d Pote1tial PaH1 





碳 全 =J 人 (中 + 癌 二 交 ) = 他 7 
CQG11G 
Kv(y) 三 昌 (zjexp ( - =(lz 一 中 十 可 ) dz 一 (Yu). 


Bo 万 开 (V) and TV) ae Jst integ1als oftpe syste11. 


Proof Asin (2.22), the differential equation for yis seX()7.JX(N7 = 3V 天 ( 
We note (0.60) and 





2 
Xc(Oy) 全 E (z d， 2a(z 9) 也 ， |z qd ， ilz 一 q， 二 5 让 人 


wherew = w+ 二 20andv = X(V) in the complex setting. Using the calculus of 
GausSsian integrals, we compute 
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0 la 0 0 0 0 
-lo 0 0 加 0 衬 
1 0 0 0 2 0 - 艺 
< (MAO=5rJ) | 0 二 0 本 
0 0 2 
0 ， 二 2 涉 0 2 0 


and inversion yields the differential equation with B(y) = (2=X7(J)JX(J)-1as 
Stated. The System is a Poisson System by Theorem 2.8. 














Assuming 7T(y) = lu = 1 we observe that the differential equations for the 
average position and momentum, g and p, read 


4=D/mm， = 一 人 VYV) (0.14) 


forv = X() andy = (pqa;D,7,0). We then note (VVV)》 一 VV(q) as 
E 一 0. The differential equations for dg and p thus tend to Newtonian equations of 
motion in the classical lmit s 一 0: 


4 三 Dj/ ， 六 = 一 VY(9)， (6.15) 


It will be useful to conslider also scaled variables 
全 人 今 人 0 
作 = (as05E<R2t with 有 = 人 0 = 二 (6.10) 


Here we have ， 国 攻 
攻 = BO 人 V 开 ( 力 (6.17) 


Where the Structure matrix 万 ( 力 is independent of s, and where 天 ( 妨 depends reg- 
ularly on s > 0. 


VIIL.6.4 A Splitting Integrator for Gaussian VWavepackets 


With the natural Splitting 妃 三 下 十 六 into kinetic and potential energy，wWe nowW 
consider the variational Splitting integrator (4.7) - (4.8)，which here becomes the 
following. 


1 We define \ 夺 记 AM as the solution at time 用 /2 of the equation for 
(1E 一 YuE)=0 foral ceT7TA4 (6.18) 


with initial value w(0) = ww E 人 4. 
2.， We define wii as the solution at tme 万 of 


(1 一 Tc)=0 foral ee 人 (6.19) 


with initial value w(0) 一 三. 
3. Then wn+i is the solution at time 刀 /2 of (6.18) with initial value w(0) 一 2 1 
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By Theorem 6.2, the Substeps in the definition of this Splitting method written in the 
coordinates y = (p,dq,a,D,7y,0) are the exact fows Pa and 0 of the Poisson 
Systems 

0 二 DBUJVYKvO) and 2= BOJ)VK7( 


Note that both equations preserve the 72 norm of v = X()， which we assume to 
be 1 in the following. 

Most remarkably, these equations can be Solved explicitly. Let us consider first 
the equations (6.19). They are written, fora =aw 十 10 andc 三 了 十 0, as 


4 = Dp/m， 4 二 一 202/m， 
(6.20) 
旋 = 0， 一， (让 2|2 + iceda)/rm， 


with initial values yo = (po, qo, ao, co) corresponding to uo = X(yo). They have the 
Solution 


Q 
| 


CQ0 


世 
t) 三 00 十 一 t) 一 1 三 一 一 一 
Q( ) d0 友 20， ( ) 20， QL ) 1 2aot/ ) 


and 


|2ol? 1EQ 2aot 
如 三 夺 二 一 十 一 1] 1 二 一 一 |】 
9 277 了 玉 本 770 


Let us now consider the equations (6.18). Iaking into account the fact that the po- 
tential Y is real, these equations are Written 


六 三 一 (如 VVV》 4=0， 
G = 一 克 ( 几 AVvw)， 有 = 0， (6.21) 
了 = 一 Yu) 十 萝 人 ATYv)， 0 = 0， 


with the 了 2 inner products 


(pe 一 gl 十 5)) 凡 (6.22) 


(2 TV》 = 人 全 (z) exp ( - 


for 到 = 及 VAY. Asthe 7 inner products in the equations for p, a, 7 depend 
only on q, 0, 0 which are constant along this trajectory, these equations can be Solved 
trivially, requiring only the compnutation of the inner products at the initial value. We 
thus see that the Splitting Scheme 2 三 102 O Oo Ka can be computed explicitly. 
This gives the following algorithm (Faou 久 Lubich 2004). 


Algorithm 0.3 (Gaussian Wavepacket Integrator). 4 step Joja 111e 如 如 如 + 
SIG11D128 Jo11t 态 e GaUSSia1z Wayepacket un 一 X(Dny anyany DTny bn) proceeds ay 
JoO1iowsy: 


7 硫 太 ( 乓 ) 三 (un ) genpby(6.22)1or 下 三 凤 V 太 AT comptute 
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玫 





Dmn+l/2 二 Don 一 了 (VT)m 
汪 几 
am 二 am 一 而 (AP)n (6.23) 
几 E 
”， 一 人 十 一 (人 TV 
加 十 天 刺 (AV) 


2.，7Z7oma 1pe values Dn+1/2， 0 叶 三 ao 才 二 and ct = 全 十 20 co1Ppute qh 
aaanhHi 十 20nHD and cn 三 271 十 onH1Via 











妃 
dan+l 三 和 un 十 一 Dn+l/2 
27 
二 吐 / (1+ 一 地) (6.24) 
7 
Ed 2 几 
ec = 地: 加 log(1 | 虹 
纪 
了 COPDIie pn+l;,Qn+ly 7nTIomz 
妃 
Dnp+l 二 PPn+l/2 一 了 (VV)n+l 
有 已 
CQm 二 L 一 CQmP 二 1 二 本 (An+i (0.29) 
己 玉 E 
?pr+1 二 ?pn+l 十 165 而 (ATY)nm+1. 


Let us collect properties of this algorithm . 


Theorem 6.4. 77Pe SPUitig Scjne1ie of 41go1itp11 6.3 1 Ga1 exD1icihi Sy1I111et1ic， 
SeCO11d-o1der 11U11e1ical 11etpod jpor Gaxssyia1 Wayvepackel dy1ia11icsy (0.77) 六 (0.73). 
118Q Poisso1m Iteg1ator jor 1je St1rUct1U1e 1IG1FX (0.72) Ga1d iD1eservesy 加 e 21 
也” norma of ipe wayepackets: un| = 1Jor am 

1 11e LE 一 0 1je positio1l G11d 1IO11E11U1L GDD1OXIN1G1IO1S qn，Dn OF 1118 
11e1jod 1e1d 1o 加 ose opiai1ed py apP1ying te 91O7F11e/ 一 Werietl 11etpod io 1je asso- 
Ciated classical 11ecja1zical yySte111 (0.79). 


The statement for s 一 0 follows directly from the equations for pn+17/2，Qn 十 1， 
pn+il and from noting (VV)， 一 VV(an). 

In view of the small parameter s, the discussion of the order of the method 
Tequires more care. Here it is useful to conslider the integrator in the Scaled variables 
了 = (pq,a;,G/sy,6/s) of (6.16). Since the differential equation (6.17) contains = 
only as a regular perturbation parameter, after 7 Steps of the Splitting integrator we 
have the s-uniform error bound 


轴 一 大) = OU )， 


where the constants Symbolized by the O-notation are independent of s and ofm and 
丸 with 7 六 入 Const. For the approximation of the absolute values of the Gaussian 
wavepackets this yields 
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je 一 etoP= Oo)， (6.26) 
but the approximation of the phases is only Such that 
la 一 we 名) 川 = O(022/e)， (6.27) 


We refer to Faou 儿 Lubich (2004) for the formulation of the corresponding algo- 
rithm for N > 1 particles, for further properties Such as the exact conservation 
of linear and angular momentum and the long-time near-conservation of the total 
energy (nn 五 wm and for numerical experiments. 


VII.7 了 kxXercises 


1.， Prove that the Poisson bracket (2.8) Satisfies the Jacobi identity (2.4) for all 
functions 忆 G, 瓦 ,让 and only ifit satisfies (2.4) for the coordinate functions 
UVI VE 
Et (FE Engel in Lie's Cesa1z111elle 4D1. vol.3, p.753). Ifthe Jacobi identity ils 
written as in (3.3), we See that there are no Second partial derivatives of 玉 (the 
left hand Side is a Lie bracket, the right-hand Slide has no Second derivatives of 
有 anyway). Other permutations Show the Same result for C and 克 . 

2. For z in an open subset of 及 mw, let 4(z) = (aij(z)) be an invertible skew- 
Symmetric 7 x 70-matrixX, with 


Daij Oaji Oa 久 
OZ 丈 Oz7 二 Di 





=0 forall ?7 大. (7.D 


(a) Show that B(z) = 4(z)-1 satisfies (2.10) and hence defines a Poisson 
bracket. 
(b) Generalize Theorem 2.8 to Hamiltonian equations (2.18) with the two-form 
wa(ebe) = er4(o)e. 
Re1aa1K Condition (7.1) Says that w ls a closed differential form. 

3. Solve the following first order partial differential equation: 





Resul 21 一 3y2,5yo2 十 273). 
4. Find two solutions of the homogeneous System 
DOP OP DF OP DOP OP DF 


3 二 2 避 2E0 泡 3 
0 0wm Dys Dy4 om Dow wa 








Such that their gradients are linearly Independent. 

5. Consider a Poisson System 7 = 已 (人 V 感 (V) and a change of coordinates 
2 一 V(yW). Prove that in the new coordinates the system is of the form 
二 (z)JY 开 (where 感 (z) = 丸 ( 轨 B( 人 YY(W)T (cf formula (3.12)) and 
玫 (z) = 玖 ( 幼 
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10. 


11. 
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Give an elementary proof of Theorem 4.3. 

Pi Define 0) := eof)B(eof)7 一 瑟 (pr(g)). Using the variational 
equation for (4.1) prove that 6(t) is the solution of a homogeneous linear dif- 
ferential equation. Therefore, 6(0) = 0 implies 6( 坟 =0forallt 


. Letz = V(V) be atransformation taking the Poisson system 7 三 马 (V)YV 厂 (y) 


to 之 二 互 (2JVK(z). Prove that @ (四 is a Poisson integrator for B(y) 让 and 
only 让 柬 (z) 一 0o gj oor1(z) is aPoisson integrator for 妃 (z). 


. Let 已 be a skew-Symmetric but otherwise arbitrary constant matrix, and con- 


Sider the Poisson system 7 = 已 V 万 ( 妇 . Prove that every Symplectic Runge-- 
Kutta method is a Poisson integrator for Such a System. 
DA# Transform 已 to block-diagonal form. 


. (M.J. Gander 1994). Consider the Lotka-Volterra equation (2.13) with Separa- 


ble Hamiltonian 妃 (wu) 三 天 (u) 十 也 (u). Prove that 
mn 十 1 二 2m 十 PunuUn 五 (un Un )， Unm+L 二 UVmn 一 jnIUn 厂 (un um) 


ls a Poisson integrator for this System 

Find a change of coordinates that transforms the Lotka-Volterra System (2.14) 
into a Hamiltonian System (in canonical form). Following the approach of Ex- 
ample 4.11 construct Poisson integrators for this System. 

Prove that the matrix 巨 (y) of Example 2.7 defines a Poisson bracket, by show- 
ing that the bracket is given as Dirac's bracket (Dirac 1950) 


{ 有 G = 人 6 并 EcoifcnG (7.2) 


Here 严 and C are functions of y， 素 and Gare smooth functions ofz Satisfying 
F(X() = 下) and G(X()) = GO，ci(z) are the constraint functions 
defining the manifold A4, and ?57 are the entries of the inverse of the matrix 
({ci; cj The Poisson bracket to the left in (7.2) corresponds to 已 (y) and 
those to the right are the canonical brackets evaluated at z = X( 妇 .Replacing 
玉 (z) by 天 (z) 十 并 Mk(zjek(z) with mx(z) such that { 玉 ck = 0 on 人 4 
eliminates the sum in (7.2) and proves the Jacobi identity for (y). 


Chapter VJIIU. 
Structure-Preserving Implementation 


This chapter ls devoted to practical aspects of an Implementation of geometric inte- 
grators. We explain Strategies for changing the Step Size which do not deteriorate the 
Correct qualitative behaviour of the solution. We study multiple time stepping strate- 
gies, the effect of round-offt in long-time integrations, and the efficient solution of 
nonlinear Systems arising in implicit integration Schemes. 


VIIU.1 Dangers of Using Standard Step Size Control 


Another possible shortcoming of the method concerns its behavior when 
used with a variable step Size .. . The integrator completely loses its desir- 
able qualities ... This can be understood at least quajlitatively by realizing 
that by changing the time step one is in essence continually changing the 
nearby Hamiltonian . . . (B. Gladman, M. Duncan 多 J. Candy 1991) 


In the previous chapters we have studied symmetric and Symplectic integrators, and 
we have Seen an enormous progress in long-time integrations of various problems. 
Decades ago, a Similar enormous progress was the introduction of algorithms with 
automatic Step Size control. Naively, one would expect that the blind combination 
of both techniques leads to even better performances. We Shall see by a numerical 
experiment that this 1s not the case, a phenomenon observed by Gladman, Duncan 
儿 Candy (1991) and Calvo 儿 Sanz-Serna (1992). 

We Study the long-time behaviour of Symplectic methods combined with the 
following standard Step Size Selection Strategy (See e.g., Hairer Ngrsett 儿 Wanner 
(1993)， 9ect. JI.4). We assume that an expresSslion en related to the local error 18 
available for the current step computed with step size 几 (usually obtained with an 
embedded method). Based on an asymptotic formula errm 久 Cj (for An 一 0)and 
on the requirement to get an error close to a user Supplied tolerance 7D/,we predict 
anew step Size by 





7o1 


eEFrm 


] = 0.85 出 (1 


Where a Safety factor 0.85 is included. We then apply the method with step Size 
An+l 三 Prew. 玉 for the new step errn+fl 挟 7D1, the step is accepted and the 
integration ls continued. If errn+l > To itils rejected and recomputed with the Step 
Size jnew obtained from (1.1) with 允 十 1instead of7m.Similar step Size strategies 
are Implemented in most codes for solving ordinary differential equations. 
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exact Solution 
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Fig. 1.1. Stormer-Verlet Scheme applied with fxed step Size (middle) or with the standard 
Step Size strategy (below) compared to the exact solution (above); solutions are for the interval 
0 和 上 <120(efb,for 2000 和 上 和 2120 (middle), and for 4000 科 寺 和 4120 (righb 


Numerical Experiment， We consider the perturped Kepler problem 








由 二 7 太一 dd 091 
1 工 一 上 1， 各 
(0@ 十 史 )3/2  (@ 十 92)5/2 
(1.2) 
中 一 mp 力 一 d2 002 
痊 一 人 nn 
(十 史 )3/2  (@ 十 92)5/2 








(0 一 0.015) with initial values 


qi(0) =1 一 e，%(0)=0，Di(0) =0，pa(0)=VL+e)/ 人 一 e) 


(eccentricity e = 0.6). As a numerical method we take the 91O7rmaze/ 一 Ye7iet ycje111e 
(1.17) which is symmetric, Symplectic, and of order 2. The fxed step Size imple- 
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Fig. 1.2. Study of the error in the Hamiltonian and of the global error for the Stormer-Verlet 
Scheme. Fixed step size implementation with 妃 = 10-3, variable step size with 7D1 一 10 一 


mentation ls Straightforward. For the variable step Size Strategy we take for errn the 
Euclidean norm of the difference between the Stormer-Verlet Solution and the Sym- 
plectic Euler solution (which is available without any further function evaluation). 
Since ermm = O(12), wetaker = 一 2in(1.1). 

The numerical solution in the (qi, qz)-plane is presented in Fig.1.1. To make 
the long-time behaviour of the two implementations visible, we Show the numer- 
ical solution on three different parts of the integration interval. We have included 
the numbers of steps needed for the integration to reach 夺 三 120, 2120, and 4120， 
Tespectively. We see that the quajlitative behaviour of the variable step Size imple- 
mentation is not correct, although it is more precise on Short intervals. Moreoveb， 
the near-preservation of the Hamiltonian is lost (See Fig. 1.2) as ls the linear error 
growth. Apparently, the error in the Hamiltonian behaves like |a 一 只 | for the vari- 
able step size implementation, and that for the solution like |ct 一 qt?| (with constants 
ab, cd depending on 721). Due to the relatively large eccentricity of the problem， 
the variable step Size Implementation needs fewer function evaluations for a given 
accuracy on a Short time interval, but the opposite ls true for long-time integrations. 

The aim of the next two Sections js to present approaches which permit the 
use of variable step Sizes for Symmetric or Symplectic methods without losing the 
quajitatively correct long-titme behaviour. 
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VIU.2 Time Transformations 


Avariable step Size Implementation produces approximations Von a (non-equi- 
distanb grid { 妃 }. The same effect can be achieved by performing in advance a 
time transformation 上 人 T and by applying a constant Step Size Implementation 
to the transformed system. Ithe time transformation ls given as the Solution of a 


differential equation, it follows from the chain rule 2 三 人 that the transformed 
System is 
y =)J)， =c( 幼 . C.D 


Here, prime indicates a derivative with respect to T, and we use the Same jletter y for 
the solutions y( 切 of y = jy) and y(r) of (2.1U). 坟 c(y) > 0, the correspondence 
ti Tisbijective. 

Applying a numerical method with constant Step Size s to (2.1) yields approxi- 
mations yn 久 WP) 三 V(t 加 ), where 六 一 mEand 


(nm 十 1)e 
t 如 1 一 如 王 | aly(7)) dr ssa(yn). (2.2) 


E 


Approximations to 如 are obtained by integrating numerically the differential equa- 
tion 才 一 al(V) together with y = ac(V) 

JIn the context of geometric numerical integration, we are interested in time trans- 
formations such that the vector field c(V)jJy) retains geometric features of 帮 Z). 


VIU.2.1 Symplectic Integration 


For a Hamiltonian system 7 = jy) = .JIV 厅 (W itis natural to search for step 
Size functions o(y) such that (2.1) is again Hamiltonian. For this we have to check 
whether the Jacobian of c(y)YV 瓦 (四 ) isymmetric (cf. Integrability Lemma VI.2.7). 
But this is the case only 让 YV 克 (J)Vc(7 is symmetric ie V(y) and Vo(y) 
are collinear so that 区 人 (y( 鸭 ) = Vc(y() 7JVY(yO) 一 0. Consequently， 
aly) = Comst along solutions of the Hamiltonian system which is what makes this 
approach unattractive for a variable step Slze integration. This disappointing fact has 
been observed by Stoffer (1988, 1995) and Skeel & Gear (1992). 

The main idea for circumventing this difhiculty is the following: Suppose we 
want to integrate the Hamiltonian System with steps of Size 几 久 sal(y) ，where 
ol) > 0is a state-dependent given function and es > 0is a small parameter. 
Instead of multiplying the vector field fy) = .一 1V 末 (V) by oa(y), we consider the 
1eW 达 G111111o111a71 


天 切 = cg( 瑟 g)- 西 )， C.3) 


where 万 0 = (yo) for a fixed initial value V0. The corresponding Hamiltonian 
System 1S 
外 一 5( 轨 7Y 厂 轨 上 +( 瑟 轨 一 配 )7 Ya(Oy) C.4) 
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Compared to (2.1) we have introduced a perturbation，which vanishes along the 
solution of the Hamiltonian System passing through yo, but which makes the System 
也 amiltonian. 

Time transformations Such as in (2.3) are used in classical mechanics for an an- 
alytic treatment of Hamiltonian Systems (Levi-Civita (1906, 1920)，where (2.3) ls 
called the “Darboux-Sundman transformation ", see Sundman (1912)). Zare 儿 Sze- 
behely (1975) consider Such time transformations for numerical purposes (without 
taking care of Symplecticity). Waldvogel 久 Spirig (1993) apply the transformations 
proposed by Levi-Civita to Hill's lunar problem and solve the transformed equations 
by composition methods in order to preserve the Symplectic structure. The following 
general procedure was proposed independently by Hairer (1997) and Reich (1999). 


Algorithm 2.1. 4PP1y da1z QFPBitrary 8y1DPlectic o1e-step 11e1 太 od Wi 态 co11s1a121 SteP 
Size E 1o 1je azazltiomia1a syste11 (2. 逢 atU81ie1lfed py 六 一 (Tisg yields 11ae 六 
ical! GaPP1oxi1C1io13 (z 刀 ) wii 写 四 如) 


Although this algorithm yields numerical approximations on a non-equldistant 
grid, it can be considered as a fixed step Size, Symplectic method applied to a differ- 
ent Hamiltonian System. This interpretation allows one to apply the standard tech- 
niques for the study of its long-time behaviour. 

A disadvantage of this algorithm is that for separable Hamiltonians (2, q) 三 
7(D) 十 V(d) thetransformed Hamiltonian (2.3) is no longer separable. Hence, meth- 
ods that are explicit for separable Hamiltonians are not explicit in the implementa- 
tion of Algorithm 2.1. The following examples illustrate that this disadvantage can 
be partially overcome for the important case of Hamiltonian functions 


HE(pg = DTMTIp+U(g， C.) 


where M is a constant Symmetric matrixX. 


上 xample 2.2 (Symplectic Euler with p-Independent Step Size Function). For 
step size functions al(a) the symplectic Euler method, applied with constant step 
Slze s to (2.4), Teads 


1 
pn+l = Dn 一 coldn)VU(dn) 一 =( 5 避 NM -pnH1 十 U(dan) 一 而)vco(o 


dmn+l 二 dm 十 sa(dqn)AMTIpn+i 





and yields an approximation at t 如 +1 = 如 十 sa(qn). The first equation is non- 
linear (quadratic) in pn+1. Introducing the scalar quantity 8 := pn := 
7 1 -1Dn+li,itreduces to the scalar quadratic equation 

2 


2 


5 





2Dm 一 sal(dqn)VU(dn) 人 s( 二 V(dn) 二 夯 ) Val(dan) 








QI 


which can be solved directly. The numerical solution (Pi,qn+1) is then given 
explicitly. 
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Choices of Step Size Functions. Sometimes suitable functions o(p, 9q) are known 
a priori. For example, for the two-body problem one can take o(D,dq) = |dll2, e.g, 
Q 一 2,ora =3/2to preserve the scaling invariance (Budd & Piggott 2003), so 
that Smaller step Sizes are taken when the two bodies are close. 

An interesting choice，which does not require any a Priori knowledge of the 
solution, is ac(y) = | The solution of (2.1) then satisfies | (7 = 1 (arc- 
length parameterization) and we get approximations yn that are nearly equidistant 
in the phase Space. Such time transformations have been proposed by McLeod 
Sanz-9erna (1982) for graphical reasons and by Huang 儿 Leimkuhler (1997). For a 
Hamiltonian system with 妃 (p, 9q) given by (2.5), it is thus natural to consider 


一 1/2 
5 9) 二 (3prMTip+ VU(OTMTIVI(O) (0.6) 


We have chosen this particular norm, because it leaves the expression (2.0) invariant 
with respect to linear coordinate changes g ”4d (implying P 4- 7D). Ex- 
ploiting the fact that the Hamiltonian (2.3) is constant along Solutions, the Step Size 
function (2.0) can be replaced by the p-independent function 


cg =(( 西 -IO)+VU Orvog) 2.7) 


The use of (2.0) and (2.7) gives nearly lidentical results, but (2.7) ls easier to im- 
plement. I we are interested in an output that is approximatively equidistant in the 
d-Space, we can take 


1T/2 


clg)= ( 一 7(q)) (2.8) 


了 上 xample 2.3 (Stbrmer-Verlet Scheme withp-Independent Step Size Function). 
For a step Size function al(dq) the Stormer-Verlet scheme gives 


np+l/2 二 Don 一 5c(gn)VU(wn) 一 引 ( 已 (oa on] 二 夯 ) Va(dn) 


dn 十 5(c(dn) 十 a(qn+i)M Lpn+lya (2.9) 


dm 十 1 





DPn+1 三 DPn+l/2 一 5c(gnHD)VU(gn+l) 


避 (E(psram， dan+1) 一 夯 ) Va(an+1l) 


The first equation ls essentially the Same as that for the Symplectic Euler method， 
and it can be Solved for pn+l/2 as explained in Example 2.2. The second equation 
is implicit in qi, but it is sufficient to Solve the scalar equation 


已 


7Y=c( 和 +E(c(o) 二 Mipntus C.10) 


for 7 = ca(an+1). Newton iterations can be efficiently applied, because Va(da) is 
available already. The last equation (for pn+l) is expliclit. This variable step Size 
Stormer-Verlet Scheme gives approximations at 如 ,Where 


E 
如 上 1 二 如 十 于 (ac(gn) 朱 al(qn+1)). 
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Fig. 2.1. Various step Size Strategies for the Stormer-Verlet Scheme (Example 2.3) applied to 
the perturbed Kepler problem (1.2) on the interval [0, 10] (approximately two periods) 


In Fig.2.1 weillustrate how the different step Size functions infuence the poSi- 
tion ofthe output points. We apply the Stormer-VYerlet method of Example 2.3 to the 
perturbed Kepler problem (1.2) with initial values, perturbation parameter, and ec- 
centricity as in Sect. VJU.1. As step Size functions we use (2.7), (2.8), and constant 
Step Size ol(dq) 三 1. For all three choices of o(q) we have adjusted the parameter s in 
Such a way that the maximal error in the Hamiltonian is close to 0.01. The step Size 
Strategy (2.7) 1 apparently the most eft8cient one. For this Strategy,， we observe that 
the output points in the dg-pPlane concentrate in reglions where the velocity is large， 
while the constant step Size Implementation Shows the opposite behaviour. 


VJIIU.2.2 Reversible Integration 


For p-reversible differential equations y = Fo ie., Fovy) = 一 py) forall ythe 
time transformed problem (2.1) remains D-Teversible 让 


OpyV) = CO (2.11) 


This condition is not very restrictive and is Satisfied by many important time trans- 
formations. In particular (2.11) holds for the arc length parameterization al(y) 一 
| 于 这 pis orthogonal, Consequently, it makes sense to apply symmetric, re- 
versible numerical methods with constant Step Size s directly to the System (2.1). 

However Similar to the Symplectic integration of Sect. VIIUI.2.1, there ls a serious 
disadvantage. For Separable differential equations (il.e., problems that can be split as 
六 = 万 (q),d= 户 (D)) and for non-constant a(p,d) the transformed system (2.1) 
is no longer Separable. Hence, methods that are explicit for Separable problems are 
not neceSSarily explicit for (2.1). 


了 上 xample 2.4 (Adaptive Stormer-Verlet Method).， We consider a Hamiltonian 
System with Separable Hamiltonian (2.3), and we apply the Stormer-Verlet Scheme 
to (2.1). This yields (Huang 儿 Leimkuhler 1997) 


2mp 二 172 二 Dopr 一 5snVU(gn) 


可 < 
dn+1 三 9qn 十 末 (sn 十 snH)M Dailya (2.12) 





必 
mp+l 一 Jp 十 1/2 5snH1YVUV(dn+I)， 
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where sn = 0(Dn+l/2,dn) and sn+l = a(Dn+l/2,dn+1) (notice that the sn+1 of 
the current step is not the same as the sn of the subsequent step, 这 c(P, dg) depends 
on D). The values (pn+l, qdn+i) are approximations to the solution at 如 ,， where 


如 +1 三 如 十 (sn 十 sn+1). 


For a D-independent Step Size function s, method (2.12) corresponds to that of Ex- 
ample 2.3, where the terms involving Va(d) are removed. The implicitness of (2.12) 
is comparable to that of the method of Example 2.3. Completely explicit variants of 
this method will be discussed in the next Section. 


We conclude this section with a brief comparison of the variable step Size 
Stormer-VYerlet methods of Examples 2.3 and 2.4. Method (2.12) ls easier to im- 
plement and more effcient when the step size function al(P, q) is expensive to eval- 
uate. In a few numerical comparisons we observed, however that the error in the 
Hamiltonian and in the solution ls in general larger for method (2.12), and that 
the method (2.9) becomes competitive when al(p,dq) is Dp-independent and easy to 
evaluate. A Similar observation in favour of method (2.9) has been made by Calvo， 
LOpez-Marcos 儿 Sanz-Serna (1998). 


VIU.3 Structure-Preserving Step Size Control 


The disappointing long-time behaviour in Fig.1.1 of the variable step Size imple- 
mentation of the Stormer-Verlet Scheme is due to lack of reversibility. Indeed, for a 
D-Ieversible differential equation the step size An+l1/2 taken for stepping from yn to 
n+l Should be the Same as that when stepping from pyn li to py (cf. Fig.V.1.1). 
The strategy of Sect. VIU.1, for which the step Size depends on information of the 
preceding step, cannot guarantee Such a property. 


VIIU.3.1 Proportional, Reversible Controllers 


Following a Suggestion of Stoffer (1988) we consider step Sizes depending only 
on information of the present step, i.e., being Propo1tiozal to Some function of the 
actual state. This leads to the algorithm 


Zn 二 1Ll 一 jiya(yn)， PT112 二 E5(VynE)， (3.1) 


Where 2jp(V) is a one-step method for y = jy), and s is a small parameter. For 
theoretical investigations it is usefu] to consider the mapping 


三 (O) :二 esoy ce)(y). (3.2) 


This is a one-step discretization, consistent with y = s(V, 0) jy), and applied with 
constant Step Size s. Consequently, all results concerning the long-time integration 
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WwWith constant Steps (e.g., backward error analysis of Chap.IX), and the definitions 
of symmetry and reversibility can be extended in a straightforward way. 


Symmetry. We call the algorithm (3.1) symmetric, 让 殉 (y) is symmetric，i.e.， 
于 一.Inthe case ofasymmetric Gn this is equivalent to 


5( 办 一 5) 5(V， 5) with 乡 一 Se。y ea) ( 细 . (3.3) 


Reversibility。 The algorithm (3.1) is called p-reversible 寺 ,wpen applied to a p- 
Teversible differential equation, 匹 (y) is p-reversible, ie po 匹 = 歼 1op(cf. 
Deftinition V.1.2). 芝 the method @n is p-Teversible then this is equivalent to 


s(O 人 轧 e)=s(s) with 分 = Cosoa(. (3.4) 


Example 3.1. Aiming at step Sizes 几 多 so (cf. (2.2)), Hut Makino 多 McMillan 
(1995) propose the use of s(y,s) = 站 (ca( 人 十 a( 办 ) where, as in Sect. VIIL.2, o(y) 
is Some function that uses an a priori knowledge of the solution of the differential 
equation. Notice that, because of 人 = Bt)(y), the value of s(y,s) is deftined 
by an implicit relation. Condition (3.3) is satisfied whenever Bn(V) is Symmetric， 
and (3.4) is Satisfied whenever @j(V) is p-reversible and o(pV) = oa(V) holds. For a 
proof of these statements one shows that s( 从 一 =) and s(y,s) (resp. s(p 一 从 e) and 
s(V,s)) are Solution of the Same nonlinear equation. 


Hovw can we find suitable step size functions s(y,s) which satisfy all these prop- 
erties, and which do not require any a priori knowledge of the solutiony In a re- 
markable publication，9Stoffer (1995) gives the key to the answer of this question. 
He Simply proposes to choose the step Size 玉 in such a way that the local error esti- 
mate Satisfies e 六 三 TD1 (in contrastto er 广 科 701 for the standard Strategy). Let us 
explain this idea in Some more detail for Runge-Kutta methods. 


上 xample 3.2 (Symmetric，Variable Step Size Runge-Kutta Methods). For the 
numerical solution ofy = jy) we consider Runge-Kutta methods 


到 = 狗 二 六》 aif( 态 )，， gt 加 十 》 2)， G.5) 


JJ 一 1 1 一 1 


With coefficients Satisfying as+1-is+17 十 aij 三 oj forall?7.Such methods are 
Symmetric and reversible (cf. Theorem V.2.3). A_ common approach for step Size 
control is to consider an embedded method 从 +1 三 加 十 闫 >》 1 已 2) (which 
has the Same internal stages 斑 ) and to take the difference nl 一 态 +1ie.， 


D(w 问 = 产 > eif(2) G3.6) 
1 


with e; 一 pb; 一 0 as indicator of the local error. For methods where 到 久 V( 刀 十 ci 站) 
(e.g.，collocation or discontinuous collocation) one usually computes the coe 午 - 
cients ei from a nontrivial solution of the homogeneous linear System 
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>》 eicorT = 一 0 for 大王 1 5 一 ] (3.7) 


This yields Dj = O(j7) with r close to s. According to the Suggestion of 
Stoffer (1995) we determine the step Size An+1/2 Such that 


|P(Un priya)ll = 7Z21. (3.8) 


A Taylor expansion around 万 = 0 shows that D(y 四 = 小 ( 人 1 二 OUT with 
some r > 1. We assume ||dr(y)|| 夭 0 and we put s = TD so that An+1/2 from 
(3.8) can be expressed by a smooth function s(V, s) as (3.1). 

To satisfy the yaz111etry relation (3.3) we determine the ei Such that 


es+l_i 一 ei forall; OT es+1_i 三 一 ei foral; (3.9) 
(Hairer 多 Stoffer 1997). Ithe Runge-Kutta method is Symmetric, this then implies 


|PGw 关 =|PUnsD 一 | with got+l 二 到 (yo)， G.10) 


This follows from the fact that the internal Stage vectors Yi of the step from Vyn” to 
WwWn+l and the stage vectors Yi of the step from yn li to yn (negative step size 一 站 
are Telated byYi; 一 胞 Ti. The step size determined by (3.8) is thus the same for 
both steps and, consequently, condition (3.3) holds. 

The reversipilitby requirement (3.4) ls a consequence of 


|ZP(mw 站 = |PDP(oryyns 及 | with yn+1L 三 2p(on) (3.11) 


which is satisfied for orthogonal mappings p (ie., op = 门 . This is seen as follows: 
applying Bj to pl1oyn4i gives Oo-1oyn, and the internal stages are 了 ; 一 0 人 1. 
Hence, we have from (3.9) that D(p-lynti ji = 士 O-1D( 站 ,and (3.11) fol- 
lows from the orthogonajlity of 25. 

A Simple Special case ls the trapezolidal rule 





加 一 加 十 2 和 全 (jn) + G.12) 


combined with 
Do 站 = 下 (JonsD 一 Fn) 


The scalar nonlinear equation (3.8) for n+1/2 can be Solved in tandem with the 
nonlinear System (3.12). 


了 上 xample 3.3 (Symmetric，Variable Step Size Stormer-Verlet Scheme)， The 
strategy of Example 3.2 can be extended in a straightforward way to partitioned 
Runge-Kutta methods. For example, for the second order Symmetric Stormer-Verlet 
Scheme (I.1.17), applied to the problem 0 = D,D = VU(q ,we can take 


有 人 一 VU(dn) 】 


DUn,gnj) 一 了 (VU(wsD TVU(o)) 
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Fig. 3.1. Stormer-Verlet Scheme applied with the symmetric adaptive step Size Strategy of 
Example 3.3 (72/ 三 0.01); the three pictures have the Same meaning as in Fig.1.1 


as error indicator. The first component is just the difference of the Stormer-Verlet 
solution and the numerical approximation obtained by the Symplectic Euler method. 
The second component is a Symmetrized version of it. 

We apply this method with /n+1/2 determined by (3.8) and 7D! = 0.01 to the 
perturbed Kepler problem (1.2) with Initial values as in Fig. 1.1. The result is given 
in Fig.3.1. We ldentify a correct qualitative behaviour (compared to the wrong be- 
haviour for the standard step Size Strategy in Fig.1.1). It should be mentioned that 
the Work for solving the scalar equation (3.8) for An+1/2 让 not negligible, because 
the Stormer-Verlet Scheme is explicit. Solving this equation iteratively, every itera- 
tion requires one force evaluation VU(q). An efficient solver for this scalar nonlin- 
ear edquation Should be used. 


A Two-Step Proportional Controller With the aim of obtaining a completely ex- 
plicit integrator, Huang 儿 Leimkuhler (1997) propose the use of two-term Tecur- 
Tence relations for the Step Size Sequence, See also Holder Leimkuhler 儿 Reich 
(2001). Instead of using a relation between An+l1/2, yn and yn+1 (cf. Example 3.1) 
Which is necessarily implicit, it ls Suggested to use a Symmetric relation between 
hn-1/2，hn+l/2，and yn，which then is explicit, In particular，with the notation 
An+1/2 三 Esn+l/2,itis Proposed to use the two-term recurrence relation 


1 1 2 
站 三 (3.13) 
Smn+1/2 Sn 一 1/2 CCyn) 





Starting with s1/2 王 alyo). In combination with the Stormer-Verlet method for 
Separable Hamiltonians, this algorithm is completely explicit, and the authors report 
an excellent performance for realistic problems. 

Arigorous analysis ofthe long-time behaviour of this variable Step Size StormerI 一 
Verlet method is much more difficult, The results of Chapters IX and XI cannot be 
applied, because it is not a one-step mapping yn 上 yn+1. The analysis of Cirilli， 
Hairer & Leimkuhler (1999) shows that, Similar to weakly stable multistep methods 
(Chap.XV), the numerical solution and the step Size Sequence contain osScillatory 
terms. Although these oscillations are usually very Small (and hardly visible), 
Seemas difficult to get rigorous estimates for them. 
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VIU.3.2 Integrating, Reversible Controllers 


All variable step Size approaches of this chapter are based on Some time transfor- 
mation t 过 了 given by 华 一 al(V) so that the differential equation, expressed in the 
new time variable 7, becomes 


y = 二 JJ)， zc)=1 (3.14) 


In Sect.VIIL2 we insert z-1 = oly) into the differential equation and apply a nu- 
merical method to y = ca( 轨 j. IPSect.VII.3.1 we first discretize the algebraic 
relation za(V) = 1 expressing 2+1/2 in terms of yn and yn+l, and then apply a 
one-step method to the differential equation in (3.14) assuming > 三 2r+1/2 being 
Constant. 

In the present Section we first differentiate the algebraic relation of (3.14) with 
respect to 7. This yields by Leibniz' rule zc(y) 十 zVa(y)TwW = 0sothat 


攻 当 
ay) 


The idea of differentiating the constraint in (3.14) has been raised in Huang 文 
Leimkuhler (1997), but soon abandoned in favour of the controller (3.13). The sub- 
Sequent algorithm together with its theoretical justification ls elaborated in Hairer 
信 Soderlind (2004). The idea is to discretize first the differential equation in (3.15) 
and then to apply a one-step method to the problem (3.14) with constant 2z. The 
proposed algorithm is thus 


xs=GO) with GO =- 一 Vol(J) 7 (3.15) 


2+1/2 二 2-1/2 十 ECG(n) 


(3.10) 
Zn+lL 一 Zey/zhiva (yn) 


with 21/2 三 20 十 EG(Vo)/2 and z0 = 1/c(yo). This algorithm is explicit whenever 
the underlying one-step method 环 1,(V) is explicit. It is called imtegratizzg controller， 
because the step Size density is obtained by Summing up Small quantities. 

Foratheoretical analysis it is convenient to introduce 2 = (2+1/2 十 2 1/2)/2 
and to write (3.10) as a one-step method for the augmented System 


工 
y =)， =“=GO G.17) 
Notice that 7(y,2z) = zolyj is a first integral of this system. 


Algorithm 3.4. Zefl @1j(V) pe a ome-step 1ietpod jpory = /内 V(0) = V0. 丙 太 
Gy give py (3.7 和 20 = 1/c(yoj, anad comstait E, We jet 


2Zm 十 1/2 一 21 十 CG(Vn)/2 


ES G.18) 
2r+1/2 十 EG(Vn+1)/2. 


Zr 十 1 





7Tje yalues yn PP1oXiaiate yy 如) Were 如 +1 三 如 十 E/zp1/2. 
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This algorithm has an interesting interpretation as Strang Splitting for the Solu- 
tion of (3.17): it approximates the fow of > = G(V) with fxed y over a half-step 
<s/2; then applies the method Ze to y = jy)/z with fxed >; finally, it computes a 
Second half-step of zx = G(V) with fxed y/. 

With the notation 


环 9 


the Algorithm 3.4 has the following properties: 





e GE。 is Symmetric whenever 9hn ls Symmetric; 
e Ge is Teversible with respect to whenever Gjh is reversible with respect to DO and 
G(pV) = 一 G(V) (this is a consequence of ac(py) = a(Z). 


These properties Imply that standard techniques for constant Step Size Implementa- 
tions can be applied to 5 and thus yield insight into the variable step Size algo- 
rithm of this section. It will be shown in Chap.XI that when applied to integrable 
Teversible Systems there ls no drift in the action variables and the global error grows 
only linearly with time. Moreover the first integral 7(y, z) = za(Vy) of the system 
(3.17) ls also well preserved (without drift) for Such problems. 


了 上 xample 3.S (Variable Step Size Stbrmer-Verlet method). Consider a Hamil- 
tonian System with separable Hamiltonian 达 (2,9) = T(P) 二 V7(q). Using the 
Stormer-VYerlet method as basic method the above algorithm becomes (starting with 
20 三 1/c(yo) andzijz 三 2 加 0 十 EG(Do,dqo)/2) 








2r+l/2 二 2 加-1/2 十 EG(Dn dan) 
Dn+l1/2 二 Dr 一 EVU(dn)/C2zn+l/2) G.20) 
qdn+Hl 三 和 十 EVT(Dpn+l/2)/z+172 
pn+1l 三 pn+l/2 一 eVU(qnHl)/(22n+1/2). 
This method is explicit，Symmetric and reversible as long as GDp 三 一 G,， and 
computes approximations on a non-equidistant grid {tn} given by 如 +1 三 如 十 


E/ 2zm1/2， 

Let us apply this method to the perturbed Kepler problem with data and initial 
values as in the beginning of this chapter Further we select o(g) = (q7g)%/2 with 
Q 一 3/2, sothat the control function (3.15) becomes 


G(D,9) = -ap7d/d7a (3.21) 


Figure 3.2 Shows the error in the Hamiltonian along the numerical Solution as well 
as the global error in the solution (fictive step Size <s 三 0.02). The error in the 
Hamiltonian is proportional to sz without drifb and the global error grows linearly 
with time (in double logarithmic scale a linear growth corresponds to a line with 
Slope one; Such lines are drawn in grey). This is qualitatively the Same behaviour as 
observed in constant Step Size Implementations of symplectic methods. 
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Fig. 3.2. Numerical Hamiltonian and global error as a function of time 
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Fig. 3.3. Step Sizes of the variable step Size Stormer-Verlet method as a function of time, and 

the control error zna(qn) 一 2z00(qo) (grey curve) 








Figure 3.3 Shows the selected Step Sizes An+1/2 王 E/2+1 /2 as a function of 
time, and the control error za(dqn) 一 z00(qo) in grey. Since its deviation from the 
constant value zoo(dao) = 1 is small without any drift, the step density remains 
close to 1/c(d). For an explanation of this excellent long-time behaviour we refer 
to Sect. XI.3. 


VIU.4 Multiple Time Stepping 


A completely different approach to variable step Sizes will be described in this sec- 
tion. We are interested in Situations where: 


e many Solution components of the differential equation vary Slowly and only a few 
components have fast dynamics; OF 

e _ computationally expensive parts of the right-hand Side do not contribute much to 
the dynamics of the solution. 


In the first case it ls tempting to use large step Sizes for the Slow components and 
Small Step Sizes for the fast ones. Such integrators, called 71IUL1i7ate 11etpods， were 
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first formulated by Rice (1960) and Gear 儿 Wells (1984). They were further devel- 
oped by Gunther 贸 Rentrop (1993) in view of applications in electric circult simula- 
tion, and by Engstler & Lubich (1997) with applications in astrophysics. Symmetric 
multirate methods are obtained from the approaches described below and are Spe- 
cially constructed by Leimkuhler 儿 Reich (2001). 

The second case Suggests the use of methods that evaluate the expensive part of 
the vector field less often than the rest. This approach is called 12zULLPLe De steDp- 
Pi108. It was originally proposed for astronomy by Hayli (1967) and has become 
very popular in molecular dynamics Simulations (Streett, Tildesley 儿 9aville 1978， 
Grubmitller, Heller Windemuth 儿 Schulten 1991, Tuckerman,，Berne 久 Martyna 
1992). As noticed by Biesiadecki 儿 Skeel (1993), one approach to Such methods is 
within the framework of splitting and composition methods, which yields Symmet- 
Tic and Symplectic methods. A second family of symmetric multiple time stepping 
methods results from the concept of using averaged force evaluations. 


VJIIU.4.1 Fast-Slow Splitting: the Impulse Method 


Consider a differential equation 


六 = JJ， JW)=JEowl)+AE(O)， (4.1) 


where the vector field is split into Summands contributing to Slow and fast dynam- 
ics, respectively, and where jglow](y) is more expensive to evaluate than FEastl (). 
Multirate methods can often be cast into this framework by collecting in glow] ( 引 ) 
those components of jy) which produce slow dynamics and in 上 Hastl(y) the re- 
maining components. 


Algorithm 4.1. ForagivenN >1andjortipe dijrerenfial eguation (4.71) aiD1e 
17111e SteDD1118 11e1jpod 18 opiai11ed ro 


[slow]\* [fastl\ N [slow] 
(ny/2 ) 。 (EN ) 592hj/2 ， (4 
WApeye Gov] CQG11G Ge G1e 71U111eFiCcaL iteg1atory co118iste1t wii yy 三 上 Elow] ( 切 


a1d 六 = East(y，respectivey 


The method of Algorithm 4.1 is already stated in symmetrized form (27 denotes 
the adjoint of @j). Itis often called the imzpulse method, because the slow part glow] 
of the vector field is used - imnpulse-like - only at the beginning and at the end of 
the step, whereas the many Small Substeps in between are concerned Solely through 
integrating the fast system 7 一 East (四 )， 


Lemma 4.2， Zet Gow] De ad1 Q1Di1ra1y Jetpod oforder 1 ad Ge QG.SyY11I111E117C 


11e1jod of order 2. 7T1e1， 1 加 e UPLe 1i11e StepPDi118 GQ1gori1p1a (4.2) 18 Sy1I11Iet1ric 
Ga11d of order 2. 

矿 jBlowl(y) anad Eastd(y) are Famailonian azad 洲 Go CQ10 Ge CI1e Do 轴 
Sy111DUectic, 1je1z 1pe 11IULDLe 1111e stepD1118 11etpod 1S also Sy1IDiectic. 
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Proojf Due to the interpretation of multiple time stepping as composition methods 
the proof of these statements ls obvious. 














The order statement of Lemma 4.2 is valid for 由 一 0, but should be taken with 
caution if the product of the step Size 几 with a Lipschitz constant of the problem 
is not Small (See Chap.XIII for a detailed analysis): it is 1ot stated, and is not true 
in general for large N, that 让 刀 and 六 /N are the step Sizes needed to integrate the 
Slow and fast System, fespectively, with an error bounded by s, then the error of the 
combined Scheme is O(E)， 


The most important application of multiple time Stepping is in Hamiltonian SyS- 
tems with a Separable Hamiltonian 


末 (Dq])=TOJ)+I(OD)， TO=Tae)TUE(D) (4.3) 


If we let the fast vector field correspond to T(P) + UEastj(d) and the slow vector 
field to ITBlow] (q),and 证 we apply the Stormer-Verlet method and exact integration， 
Tespectively, Algorithm 4.1 reads 


[slow] [fast] 代 [fast] \ [slow] 
jp/2 9 (ppyj2N 9 %2j/N 9 Opj2N) 9 %2Pp/2 ， (4.4) 
where 7 ， ofow| ， os are the exact flows corresponding to the Hamiltonian sys- 


tems forT(D), DBowl(d) ,Dast(d),respectively. Notice that for V = 1 the method 
(4.4) reduces to the Stormer-Verlet Scheme applied to the Hamiltonian System with 
瓦 (p, 四 .This is a consequence of the fact that olestl opPowl = po is the exact 
fow of the Hamiltonian System corresponding to UV(d) of (4.3). In the molecular 
dynamics literature, the method (4.4) is known as the Verlet-I method (Grubmiiller 
et al. 1991， who consider the method with little enthusliasm) or T-RESPA method 


(Tuckerman et al. 1992, with much more enthusiasmy). 


了 xample 4.3. In order to illustrate the effect of multiple time stepping we choose a 
“Solar System ”with two planets, ji.e., with a Hamiltonian 


五 (2,9) = 





工 他 十 2 pl 十 77207721 77207722 77217722 


2、770 IT21 7712 lo 一 al leo 一 ol le 一 cl 


where 7n0 = 1 nl = na = 10-2 and initial values qo = (0,0), io = (0,0)， 

4 三 (10),0 = (0,1), aa =(40), 0 =(0,0.5). With these data, the motion of 

the two planets ls nearly circular with periods close to 27 and 147, respectively. 
We split the potential as 





DIE 一 一 TO， Tong 一 一 人 
le 一 和 al lm 一 le 一 加 | 
and we apply the algorithm of (4.4) with N = 1 (Stormer-Verlebj, N 三 4 and 
AN = 8.Since the evaluation of oaov] is about twice as expensive as [eeH and 


that of py is of negligible cosb the computational work of applying (4.4) on a fxed 
interval is proportional to 
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Fig. 4.1. Maximal error in the Hamiltonian as a function of computational work 





27r (2 十 上 ) 

4 (4.3) 
Our computations have Shown that this measure of work corresponds very well to 
the actual cpu time. 

We have Solved this problem with many different step Slzes 几 . Figure 4.1 Shows 
the maximal error in the Hamiltonian (over the interval [0,200rj) as a function of 
the computational work (4.3), We notice that the value N = 4 yields excellent 
results for relatively large as well as small Step Sizes. It noticeapbly Improves the 
performance of the Stormer-Verlet method. 玉 NM becomes too large, an irregular 
behaviour for large Step Sizes ls observed. Such “artificial resonances” are notorious 
forthis method and have been discussed by Biesiadecki & Skeel (1993) for a Similar 
experiment; also See Chap. XII. For large N we also note a loss of accuracy for 
Small step Sizes. The optimal choice of N (which here ls close to 4) depends on the 
problem and on the Splitting into fast and slow parts, and has to be determined by 
experiment. 


The multiple time stepping technique can be iteratively extended to Problems 
with more than two different time Scales. The idea ls to Split the "fast ”vector field 
of (4.1) into Fasg(y) = jJ(W) + Us(y), and to replace the method Gas 记 
Alsgorithm 4.1 with a multiple time stepping method. Depending on the problem, a 
Signifcant gain in computer time may be achieved in this way. 

Many more multiple time stepping methods that extend the above Verlet-IT- 
RESPA/impulse method,，have been proposed in the literature,， most notably the 
mollified impulse method of Garcia-Archilla，Sanz-Serna 人 Skeel (1999); see 
Sect. XIII.1. 


VJIU.4.2 Averaged Forces 


Adifferent approach to multiple ttme stepping arises from the ldea of advancing the 
step with averaged Jorce eyaluatio11s. We describe Such a method for the Second- 
order equation 
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请 = jg = 疡 风声 (4.0) 


The exact solution Satisfies 
1 
yt+ 月 一 2 的 + 区 -有 = 性 太 和 - 几 7t+ 的 )@， 


where the integral on the right-hand side represents a welighted average of the force 
along the solution, which is now going to be approximated. Att = 如 ,we replace 


(0(t 十 07)) 久 jlov] (2 诗 East] (ulbhJ) 


where V(7T) is a solution of the differential equation 


必 一 JEow(n) 二 FE (u) (4.7) 


We then have 
岂 太 G- 名 (GAN))0 = 一 2u(0)+u 人 月 ， 


The velocities are treated Similarly, starting from the identity 


gt 二 月 一 和 一 内 = 站 FU 后 )ag， 


< 
ASymmetric Two-Step Method. For the differential equation (4.7) we asSsume the 
initial values 


uC0) 二 mm， VC0) 一 on . (4.8) 


This initial value problem is Solved numerically, e.g., by the Stormer-Verlet method 
with a smaller step size 士 /N on the interval [一 岂 则 , yielding numerical approxi- 
mations wN ( 士 尹 ) and vN( 士 几 ) to v( 士 尹 ) and ( 士 站, respectively. Note that no fur- 
ther evaluations of glow] are needed for the computation of ww ( 士 刀 ) and wN( 士 站. 
This fnally gives the Symmetric two-step method (Hochbruck 儿 Lubich 1999a) 























ni 一 2 十 yn 1 三 WN() 一 2wN(0) 十 N( 一 万) (4.9) 
Vn+1l 本 Vn -1 和 UN (及 ) VN( 一 尹 ) 


The starting values Vi and Wi are chosen as UN( 几 ) and vwN( 凡 ) which correspond to 
(4.7) and (4.8) for 史 = 0. 


A Symmetric One-step Method.，An explicit one-step method with similar aver- 
aged forces ls obtained when the initial values for (4.7) are chosen as 


u(0) 一 砍 ， Ya0) =0. (4.10) 


It may appear crude to take zero initial values for the velocity, but we remark that 
for linear jast] the averaged force (ww( 站 一 2u(0) 十 w( 一 内)/P2 does not depend on 
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the choice of (0). Moreover the solution then satisfies ( 一 妨 王 必 ( 轨 , so that the 
computational cost is halved. We again denote by wwN( 几 ) = VN( 一 /) the numerical 
approximation to w(P) obtained with step Size 士 几 /N from a one-step method (e.g,， 
from the Stormer-Verlet Scheme). Because of (4.10) the averaged forces 


J 三 (uv( 站 一 2uv(0) +uwN(- 门 ) = 声 (uv(P) 一 wN(O) 


几 


now depend only on yand not on the velocity yn. In trustworthy Verlet manner， 
the Scheme yn 1 一 27n 十 Vyn 1 三 jh2 FF can be written as the one-step method 








几 
Vn+1/2 二 mn 十 可 了 mr 
Zn+l 二 Un 下 RUn+112 (4.11) 
必 
Zn+l 二 Un+l/2 十 本 Zn+l1 ， 





The auxiliary variables vn can be interpreted as averaged velocities: we have 


2Vm++l 一 yn 一 1 2 人 (t+1) 3 2 人 (如 -1) 1 人 
on 二 一 g( 刀 十 gp)a0. 
2 2 2/ ， 





This average may differ substantially from V( 刀 ) 让 the solution is highly oscillatory 
in | 一 尺 , 几 . In the experiments of this section it turned out that the choice wo = 加 0 
and 三 Vn as velocity approximations gives excellent results. 

In a multirate context，Symmetric one-step Schemes using averaged forces 
were studied by Hochbruck & Lubich (1999b), Nettesheim 人 儿 Reich (1999), and 
Leimkuhler & Reich (2001). A closely related approach for problems with multiple 
time Scales ls the heterogeneous multiscale method by 也 (2003) and Engquist 
Tsali (2005). 


Example 4.4， We add a satellite of mass na 一 10-4 to the three body-problem of 
Example 4.3. It moves raplidly around the planet number one. The initial positions 
and velocities are 9q3 = (1.01, 0) and ps = (0,0). We split the potential as 


TTr[East] 9) 二 一 77717773 ， Urlow] dg) 二 一 77257727 
人 | | 
(7 天 (1,3) 


and we apply the methods (4.9), (4.11), and the impulse method (4.4). Since the 
sum in Uslow] contains 5 terms, the computational work is proportional to 








十 人 
2 人 for methods (4.11) and (4.4) 
6 十 2N 

6 for method (4.9). 


For each of the methods we have optimized the number N of Small stepSs. We ob- 
tained a fat minimum near N = 40 for (4.9) and (4.4), and a more pronounced 
minimum at N 三 12 for (4.11). Figure 4.2 Shows the errors att 一 10 in the posi- 
tions and in the Hamiltonian as a function of the computational work. 
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Fig. 4.2. Errors in position and in the Hamiltonian as a function of the computational work:; 
the classical Stormer-Verlet method, the impulse method (4.4), and the averaged force meth- 
ods (4.11) and (4.9). The errors in the Hamiltonian are indicated by grey lines (Same linestyle) 





The error in the position ls largest for the Stormer-Verlet method and Signif- 
icantly Smallest for the one-step averaged-force method (4.11). The errors in the 
velocities are about a factor 100 larger for all methods. They are not included in 
the fgure. The error in the Hamiltonian ls very Similar for all methods with the 
exception of the two-step averaged-force method (4.9), for which it is much larger. 


VIU.S Reducing Rounding Errors 


...the ideais to capture the rounding errors and feed them back into the 
Summation . (N.J. Higham 1993) 


All numerical methods for solving ordinary differential edquations require the com- 
putation of a recursion of the form 


2 人 后 俩 中 0mn， (3.1) 


Where jn, the increment, is usually smaller in magnitude than the approximation yn 
to the solution. In this situation the rounding errors caused by the computation of 0 
are in general smaller than those due to the addition in (9.1). 

A first attempt at reducing the accumulation of rounding errors (in fxed-point 
arithmetic for his Runge-Kutta code) was due to Gill (1951). 天 ahan (19065) and 
Moller (1965) both extended this idea to foating point arithmetic. The resulting al- 
gorithm is nowadays called “compensated Summation , and a particularly nice Pre- 
Sentation and analysis ls given by N. Higham (1993). In the following algorithm we 
asSsume that yn ls a Scalar; Vector valued recursions are treated componentwise. 
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Algorithm 5S.1 (Compensated Summation). Zel yo amd {0n}n>o pe giver amd 
PU1 e 三 0. Co111DpUte WUV2 .Jo11a (9.1) asjJollowsy: 
Jor 允 三 0,12,. .ao 
Q 一 mn 
e 一 e 十 0 
Zn+L1 王 Q 十 e 
e=e 十 (一 名 站 
e11d do 





This algorithm can best be understood with the help of Fig.3.1 (following the 
presentation of N. Higham (1993)). We present the mantissas of Hoating point num- 
bers by boxes, for which the horizontal position indicates the exponent (for a large 
exponent the box is more to the lefb. The mantissas of yand e together represent 
the accurate value of yn (notice that in the beginning e = 0). The operations of 
Algorithm 5.1 yield yl and a new e, which together represent yn+1l 三 Vn 十 0n. 
No digit of ov is lost in this way. With a standard summation the last digits of ov 
(those indicated by 0” in Fig.S.1) would have been missed. 





















































4 三 yn al4 ol 
e e! 0 
0n 01! 07 
ee 一 e 十 0n 61 e' 十 0 
yn+1 一 4 十 e a' a “十 6! 
e 一 e+(a 一 on+1) e' 十 0 0 














Fig. $.1. Ilustration of the technique of “compensated summation” 


Numerical 玉 xperiment，We study the effect of compensated summation on the 
Kepler problem (1.2.2) (written as a first order System) with eccentricity e = 0.6 
and initial values as in (L.2.11),， so that the period of the elliptic orbit is exactly 
27. As the numerical integrator we take the composition method (V.3.13) of order 
8 with the Stormer-Verlet Scheme as basic integrator. We compute the numerical 
Solution with step size 到 = 2r/500 once with standard update of the increment， 
once with compensated Summation (both in double precision) and, in order to get a 
reference Solution, we also perform the whole compnutation in quadruple precision. 
The difference between the double and quadruple precision computations gives Us 
the rounding errors. Their Euclidean norms as a function of time are displayed in 
Fig.3.2. 

We see that throughout the whole integration interval the rounding errors of the 
standard implementation are nearly a factor of 100 larger than those of the imple- 
mentation with compensated Summation. This corresponds to the inverse of the step 
Size or more precisely, to the mean quotient between yn and On in (3.1). In Fig. 5.2 
wehave also included the pure global error of the method (without rounding errorSs) 
at integral multiples of the period 27 (hence no oscillations are Visible). This ls 


324 VIU._ Structure-Preserving Inplementation 









-standard computation _- 





-一 pure global error 一 










compensated summation -一 一 一 


攻 二 人 一 让 六 

101 10? 103 104 
Fig. S$.2. Rounding errors and pure global error as a function of time; the parallel grey lines 
indicate a growth of O(t3/2) 











obtained as the difference of the numerical solution computed with quadruple pre- 
cision and the exact Solution. We observe a linear growth of the pure global error 
(this will be explained in Sect.X.3) and a growth like O(t3/ 2) due to the rounding 
errors. Thus, eventually the rounding errors will Surpass the truncation errors,， but 
this happens for the compensated Summation only after Some 1000 periods. 


Probabilistic 上 Explanation of the Error Growth. Our aim is to explain the growth 
rate of rounding errors observed in Fig.3.2. Denote by sk the vector of rounding 
errors produced during the computations in the kth step. Since the derivative of the 
fow wx(y) describes the propagation of these errors, the accumulated rounding error 
attimet 一 LN (大 一 Ki1S 


N 
7 一 >》， O 人 (JER (5.2) 
1 


For the Kepler problem and, in fact, for all completely integrable differential equa- 
tions (cf. Sect.X.l) the fow and its derivative grow at most linearly with time, 1.e.， 


| 和 (Js ar+adt 一 妨 ) for 二 > 刀 ， (5.3) 


Using sf = O(eps), where eps denotes the roundoff unit of the computer an appli- 
cation of the triangle inequality to ($.2) yields 7 = O(t2eps). From our experiment 
of Fig.3.2 we see that Such an estimate is too pessimistic. 

For a better understanding of accumulated rounding errors over long time inter- 
vals we make use of probability theory. Such an approach has been developed in 
the classical book of Henrici (1962). We assume that the components ski of sk are 
711do1z VGFiGD1es with mean and variance 


一 (si) = 0， War(sm) = Cui eps”， 


and uniformly bounded Cu 和 C.For simplicity we assume that all sxi are indepen- 
dent random variables. Replacing the matrix pf 忆 () in (3.2) with pf， 忆 (Y(t)) 
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and denoting its entries by wijF， the ;th component of the accumulated rounding 


eITrOr ($.2) becomes 
AN 
714i 一 > > QZO27KEKT) 
K=1 7 一 1 


a linear combination of the random variables skj. Elementary probability theory 
thus implies that 


NV 7 
五 (ii) 一 0 and Yor (Ti 三 >》， 》， 02 TD (sb ). 
KE=17=1 
JInserting the estimate (5.3) for wi7jK we get 


N 
C 
Jr (16 ) 芯 > (e+ 上 一 雪 ) ,ax Var(sij) 三 O( 亏 记 ep? ) 


全 


z 
中 | 
片 


Consequently, the Euclidean norm of the expected rounding error 71x 1S 


全 oo) = (ws eps 


This is in excellent agreement with the results displayed in Fig. 3.2. 


VIU.6 Implementation of Implicit Methods 


Symplectic methods for general Hamiltonian equations are implicit, and So are Sym- 
Imetric methods for general reversible Systems. Also, when we consider variable Step 
SiZe extenslons as described in Sections VJIHU.3 and VIU.2, we are led to nonlinear 
equations. The efficient numerical solution of such nonlinear equations ls the main 
difficulty in an Implementation of implicit methods. Notice that in the context of 
geometric integration there ls no need of ad-hoc Strategies for step Size and order 
Selection, So that the remaining parts of a computer code are more or less Straight- 
forward. 

In the following we discuss the numerical solution of the nonlinear System de- 
fned by an implicit Runge-Kutta method. We have the Gauss methods of Sect. I.1.3 
in mind which are Symplectic and Symmetric. An extension of the ldeas to parti- 
tioned Runge-Kutta methods and to Nystrom methods ls obvious. For Simplicity of 
notation we consider autonomous differential equations 7 = Fo) ,and we write 
the nonlinear System of Deftinition II.1.1 in the form 


Zi 一 岂 》 ai 加 十 Op =0， 1 5 (6.1) 
IJ 一 1 
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The unknown variables are Zn,...,Zsn, andthe equivalence of the two formula- 
tions ls via the relation ii; 一 了 ( 十 Zn). The numerical solution after one Step 
Can be expressed as 


儿 1 名 十 几 》 5 十 2 (6.2) 


2 并 


For implicit Runge-Kutta methods the equations (6.1) represent a nonlinear System 
that has to be Solved iteratively. We discuss the choice of good starting approxima- 
tions for Zir as well as different nonlinear equation Solvers (fxed-point iteration， 
modified Newton methods). 


VJIIU.6.1 Starting Approximations 


The most simple approximations to the solution Qin of (6.1D) are 2 = 0 or 
20 = jhcij(n) where ci = 7-1 aij. They are，however, not Very accurate 
and we will try to exploit the information of previous steps for Improving them. 
There are essentially two possibilities: either use only the information of the last 
step yn_1 hm yn (methods (A) and (B) below), or conslider a fxed 7 and use the 
interpolation polynomial that passes through iv for ! = 1,2,..，(method (C)). 
Let us Separately discuss these two approaches. 


(A) Use of Continuous Output. Consider the polynomial wn_1(b of degree s that 
interpolates the values (如 -1 yn 1) and (如 -1 十 cij mi)forz 三 1 ...,s,where 
in-1=yn I++T2in listheargumentin (0.1) ofthe previous step. For collocation 
methods (such as Gauss methods) wn_i(t) is the collocation polynomial, and we 
know from Lemma II.1.0 that on compact intervals 





un 人 提 一 yy 提 二 OUpatD (6.3) 


with dg = s, where yy 人 (加 denotes the solution ofy = jy) satisfying y( 刀 -1) = yn 一 1. 
For Runge-Kutta methods that are not collocation methods, (6.3) holds with qd de- 
fined by the condition C(a) of (TI.1.11). Since the solution of y = JW) passing 
through y( 妃 ) = 加 is O(Rh2+l) close to yy with p > q, we have wn 人 (人 一 
WwWn_i( 臣 十 O(P+l) and the computable value 


有 = 天 -加 了 =wn-li( 刀 十 ci 站 (6.4) 


Serves as starting approximation for (6.1) with an error of size O(jha+T). This ap- 
proach ls standard in variable Step Size implementations of implicit Runge-Kutta 
methods (cf. Sect.IV.8 of Hairer 久 Wanner (1996)). Since wn 1 一 iisaln- 
ear Combination of the im -1 三 了 nl1 一 yl,itfolows from (6.1) that it is also 
alinear combination of 屎 六 25 1), so that 


更 = 加 1 十 太 》 0 (6.5) 
7J=1 
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For a constant step Size implementation, the /jdepend only on the method coef- 
ficients and can be computed in advance as the solution of the linear Vandermonde 
type System 


1 十 ci)x 
pick-1= ii (6.6) 


(See Exercise 2). For collocation methods and for methods with g > 5 一 1the 
coefficients Cj from (6.6) are optimal in the sense that they are the only ones making 
(6.5) an sth order approximation to the solution of (0.1). For g < s 一 1 more 
complicated order conditions have to be conslidered (Sand 1992). 


(B) Starting Algorithms Using Additional Function 了 Evaluations. In particular 
for high order methods where s is relatively large, a much more accurate Starting 
approximation can be constructed with the aid of a few additional function eval- 
uations. Such starting algorithms have been investigated by Laburta (1997)，who 
presents coefficients for the Gauss methods up to order 8 in Laburta (1998). 

The idea is to use starting approximations of the form 


了 三 yn-1 十 刀 》， Di 了 玉 m1) 十 刀 》， 了 玖 区 放生 坦 ， (0.7) 
了 =1 了 一 1 
where myYsn laretheinternal stages ofthe basic implicit Runge-Kutta 


method (with coefficients cj, aii) o7), and the additional internal stages are computed 


from 
S 十 1 一 工 


Hin-1 三 rn-1 十 用 >》 Hi7 7Y7n 一 1 


和 1 


For a fxed i we interpret Y9 as the result ofthe explicit Runge-Kutta method with 
coefficients of the right tableau of 








exact ?th Stage approximate 
C 4 cC| 4 
| 开 星 放下 


Here, (Ma Ma) = M = (0 Ai 三 十 HUjjs and ci Di Pi are the vectors 
composed of cj /7 CD 放 Wi 思 Tespectively. The exact stage values Yin are interpreted 
as the result of the Runge-Kutta method with coefficients given in the left tableau 
of (6.8). The entries of the vectors 也 D and ai are 1, oj and aij, Tespectively, and 万 
is the matrix whose rows are all equal to 0 . 

芝 the order conditions (See 9ect.IU.1) forthe twoRunge-Kutta methods of (0.8) 
give the Same result for all trees with 和 7 vertices, we get an approximation of order 
7 ie Y0 一 2 一 O(2r+1). For the bushy tree 礁 一 [..，。] with vertices 
We have 
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S TI S S 
>》 05c 和 1 十 》 zi1T1 >》 jeg 十》 ai 十 oj 和 1 (0.9) 
J1 JI J=1 j1 


Notice that for collocation methods (Such as the Gauss methods) the condition C(s) 
reduces the right-hand expression of this equation to (1 二 cj)j8/K for 大 < 5. For 
7 一 0, these conditions are thus equivalent to (0.0). 

For the tree [六 ] = |[ ,| with 大 十 Tvertices we get the condition 


加 7 S 7 
>》， Bijajtcr 十 2 Zi7 ( Hzzcf 十 >》， 1 
1 1 一 1 1 一 1 


7I=1 
加 如 
一 》， Dajrce 开 十 2 Qi (pc 和 十 oj(L 十 中 


JI=1 TI=1 


(0.10) 


We now assume that the Runge-Kutta method corresponding to the right tableau of 
(6.8) satisfies condition C(s). This means that the method (c, 4,D) is a collocation 
method, and that the coefficients /ij have to be computed from the linear System 


S 十 ?一 1 开 
"Joncfr1= 大 二 1 .5. (6.11D) 
了 二 于 


The method corresponding to the left tableau of (6.8) then also satisfies C(s). Con- 
Sequently, the order conditions are Simplified considerably,，and it follows from 
Sect.II.1 that Y9 is an approximation to the exact stage value Yin of order s 十 1 or 
s 十 2ifthetfolowing conditions hold: 


order 5 十 工 让 (6.9) for 大 三 |... ,5 十 1; 


(6.12) 
order 5 十 2 让 (6.9) for 有 三 1... ,5 十 2,and(6.10) for 有 三 s 十 工 . 








For an approximation of ozder s 十 上 we put mm 三 1， we arbitrarily choose 
1，wWe compnute /11 from (6.11), and the coefficients Cj and il from (6.9) with 
开 一 1... ,5 十 1.Areasonable choice for the free parameter is /1 E [1,2] (in our 
computations we take 1 一 1.75fors 一 2,4andh 二 1.8fors=6. 

For an approximation of orders 十 2 weput 了 7 三 3.One ofthethree additional 
function evaluations can be Saved 让 we put HL = 0 and /2 三 1. This implies 
3 1 三 Viand 对 nl 三 加 ,Sothat the evaluation of /ZL 1i) is 
already available from computations for the preceding Step (FSAL technique，first 
Same as lastf ). In our experiments we take /13 三 1.6 for s 三 2, 13 三 1.65 for 
S 二 4,and1Ha 三 1.75fors 三 6.The coefficients 1i 罗 成 Zi7 are then obtained as 
the solution of Vandermonde like linear Systems. 

For an implementation it is more convenient to work with the quantities 20 一 
Y0 一 加 andto write (6.7) in the form 


亿 





1 Laburta (1997) proposes to consider mm 一 2, 1 = 0, Ha = 1 (apart from the first step 
this also needs only one additional function evaluation per step),，and to optimize free 
parameters by satisfying the order conditions for Some trees with one order higher. 
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2 = 六》 ai DT 十 >》 vi (6.13) 


JJ 二 1 JJ 三 1 
WwWith Qi 一 [05 0 


(C) Equistage Approximation. From the implicit function theorem, applied to the 
nonlinear System (6.1), we know that Qir 三 2(yn, 几 )，where the function z(2y 六 
is as smooth as j 帮 (四 . Furthermore, since on compact intervals the global error of a 
one-step method permits an asymptotic expansion in powers of 几 ，we have yn 三 
WN(t biJ 二 ONT+1I) withywv 全 站 三 2V 提 十 Per 办 十 .二 he 人 (thevalue 
of N can be chosen arbitrarily large 让 F(y) is suffciently smooth). Consequently， 
Zn iis O(AN+1) close to the smooth function 2(VN(t， 九 )， 刀 ) 引 寺 三 刀 二 0 
Let G( 坟 be the polynomial of degree 大 一 1 defined by G( 刀 -0) = 2 r for 
/一 1,...,K.Then,the value 

2 = G( 如 ) (6.14) 


yields a O(P+1) approximation to the solution of (6.1). This interpolation pro- 
cedure was first proposed by In't Hout (1992) for the numerical Solution of delay 
differential equations. For the iterative solution of the nonlinear Runge-Kutta equa- 
tions (6.1), the starting approximation (0.14) 1i8 proposed and analyzed by Calvo 
(2002). 

The Implementation of this approach is very Simple. Using Newton s interpola- 
tion formula we have 


2 二 态 h1 二 VZ1 二 十 Vi (6.15) 


With backward differences given by VZLin 三 Lin 一 Lin 1， V22in 三 -V 2 交 一 
V2Zin 1l， etc . 


Numerical Studqy of Starting Approximations. We consider the Kepler problem 
with eccentricity e = 0.6 and initial values Such that the period is 27. With many 
different step sizes 几 = 27r/N we compute NV 十 1 steps with the Gauss method 
of orderp = 2s (ID = 48,12). In the last step we compute the different starting 
approximations and their error (2 2 一 20.2)V2 as a function of the step 
SiZze 几 . The result is plotted in Fig. 6.1. There, the pictures also contain the global 
errors after one period. They allow us to localize the values of 刀 ,which are of 
practical interest. 

We observe that the equistage approximation (6.15) also behaves like O(PA+D) 
when 大 十 lis larger than the order of the integrator. However, due to the increas- 
ing error constants, the accuracy is Improved only for Small Step Sizes. An opti- 
mal 8 could be estimated by checking the decrease of the backward differences 
1V2 2 il. The error of the starting approximation obtained from the continuous 
output behaves like O(js+1) (for the Gauss methods) and, in contrast to the equi- 
Stage approximation, improves with increasing order. The approximations (0.7) of 
order s 十 1ands 二 2areaclearimnprovement. As aconclusion we fnd that for this 
example the equistage approximation (which is free from additional function eval- 
uations) ls preferable only for s = 2 (order 4). For higher order the approximation 
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Step Size 几 
| LU 上 LU | | LU 
103 10” 1071 103 10” 107 103 10” 107 
Fig. 6.1. Errors of starting approximations for Gauss methods as functions of the step Size 
几 : thick dashed lines for the extrapolated continuous output (6.4) and for the approximations 
(6.7) of order s 十 1and s 十 2;thin Solid lines for the equistage approximation (6.13) with 
太一 0,1,...,7;thethick solid line represents the global error of the method after one period 




















10-15 








obtained from (6.7) is Signifticantly more accurate and So it is worthwhile to Spend 
these two additional function evaluations per step. 


VIIH.06.2 Fixed-Point Versus Newton Iteration 


Finally we investigate the iterative Solution of the nonlinear Runge-Kutta System 
(6.1). We discuss fxed-point and Newton-like iterations, and we compare their ef 但- 
ciency to the use of composition methods. 


Fixed-Point Iteration。 This is the most Simple and most natural iteration for the 
solution of (6.1). With any starting approximation 2Z0 from Sect. VIIL.6.1 it reads 


2 (6.10) 


和 1 


In the case where the entries of the Jacobian matrix 大 (yy) are not excessively large 
(nonstiff problems) and that the step Size ls sufficiently Small, this iteration con- 
Verges for 8 一 oo to the solution of (6.1). Usually, the iteration is stopped 这 a 
certain norm of the differences 4 一 ZU is sufficiently small. We then use 2A 
in the update formula (6.2) so that no additional function evaluation is required. 
For a numerical Study of the convergence of this iteration， we consider the 天 e- 
pler problem with eccentricity e = 0.6 and initial values as in the preceding experi- 
ments (period of the solution ls 27). We apply the Gauss methods of order 4, 8, and 
12 with various step Sizes. For the integration Over one period we Show in Iable 6.1 
the total number of function evaluations, the mean number of required iterations per 
step, and the global error at the endpoint of integration. As a Stopping criterion for 
the fxed-point iteration we check whether the norm of the difference of two SuccesS- 
sive approximations is smaller than 10-16 (roundoff unit in double precision). As 
a starting approximation 2Z0 we use (6.15) with = 8 for the method of order 4， 
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Table 6.1. Statistics of Gauss methods (total number of function evaluations,，number of 
fxed-point iterations per step, and the global error at the endpoinb for computations of the 
Kepler problem over one period with e 三 0.6 


Fixed-point iteration (general problems) 

















Gauss 彤 一 2r/25 | 尹 一 27r/50 | 玫 =2r/100 | 万 = 2r/200 | 万 = 2r/400 
803 1043 1393 1825 2319 
order 4 16.1 10.4 学 和 有 4.6 2.9 
9.2.10-? 1.7.10-? 1.3. 10-3 8.4.10-5 5.3 .10-6 
1021 1455 2091 3007 4183 
order 8 9.7 6.8 4.7 3.3 2.1 
1.1.10-3 | 6.9.10-” 3.6. 10-? 在 1 
1297 1731 2311 3441 5917 
order 12 8.3 5.4 3.5 2.5 2.1 
2.7.10-5 | 8.0.10-0 | 2.7.10- 坟 | < roundoff | < roundoff 























and the approximation (6.7) of order s 十 2for the methods of orders 8 and 12. The 
coefficients are those presented after equation (6.12). 

Since the starting approximations are more accurate for Small 几 , the number 
of necessary iterations decreases drastically. In particular, for the 4th order method 
we need about 16 iterations per step for 户 = 2r/25, but at most 2 iterations when 
妨 和 27/800. 开 one is interested in high accuracy computations (e.g., long-time 
Simulations in astronomy), for which the error over one period is not larger than 
10-10, Table 6.1 illustrates that high order methods (2 > 12) are most efficient. 


Newton-Type Iterations。A standard technique for solving nonlinear equations is 
Newton' s method or some modification of it. Writing the nonlinear System (6.1) of 
an implicit Runge-Kutta method as F(2Z) = 0 with 2 = (2 ,2sn)7 ,the 
Newton iteration 1S 


GT 一 22 一 MIPOZG， (6.17) 


where WM is some approximation to the Jacobian matrix 严 /(2Z4). Since the solution 
2 of the nonlinear System is O(P) close to zero, it is common to use AM = 有 (0) 
So that the matrix M is independent of the iteration index K. In our Special Situation 
We get 


M=ITQT-nA4@J (0.18) 


with jy = 大 (加 ). Here, 7 denotes the identity matrix of suitable dimension, and 4 
is the Runge-Kutta matriX. 

We repeat the experiment of Iable 6.1 with modified Newton iterations instead 
of fxed-point iterations. The result is Shown in Table 6.2. We have Suppressed the 
error at the end of the period, because it is the Same as in Table 6.1. As expected, the 
Convergence is faster (le., the number of iterations per Step ls Smaller) so that the 
total number of function evaluations is feduced. However, we do not see in this table 
that we computed at every step the Jacobian 太 (yn) and an 厂 尺 -decomposition of 
the matrix M. Even 让 we exploit the tensor product Structure in (0.18) as explained 
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Table 6.2. Statistics of Gauss methods (total number of function evaluations,， number of 
iterations per step) for computations of the Kepler problem over one period with e = 0.6 


Modified Newton iteration (general problems) 

















Gauss 彤 一 27/125 = 27r/50 | 大 = 2r/100 | 产 = 2r/200 | 疡 = 2r/400 
本 允 和 号 到 本 台大 区 王 呈 要 
本 | 玫 本 琶 瑟 本 区 基 本 本 克 
本 本 大 风 天 医 区 到 本 下 























in Hairer 儿 Wanner (1996， Sect.IV.8), the cpu time ls now consliderably larger. 
Further improvements are possible, if the Jacobian of 三 and hence also the 二 尺 - 
decomposition of M is frozen over acouple of steps. But all these efforts can hardly 
beat (in cpu time) the straightforward fxed-point iterations. In accordance with the 
experience of Sanz-Serna 儿 Calvo (1994, Sect.3.3) we recommend in general the 
Use of fxed-point iterations. 


Separable Systems and Second Order Differential Equations。 Many interesting 
differential equations are of the form 


站 = 7y)， 


For example, the second order differential equation = JW is obtained by putting 
9(7) 一 1.Also Hamiltonian systems with separable Hamiltonian 妃 (p, 9) = 人 (四 十 
V(a) are of the form (6.19). 

For this particular System the Runge-Kutta equations (6.1) become 


2 一 97)， (6.19) 


Ga 一 >》 aijjf( 和 TO =0， Zi 一》 aig(n 二 Gn)=0， 


和 =1 和 1 


In this case we can still do better: instead of the standard fxed-point iteration (0.10) 
we apply a Gauss-Seidel like iteration 


GT 一 六 》 ai 二 2， 


和 1 


ZJ 一 凡 》 aijg(m 十 (和 1)， (6.20) 
JJ 一 1 


Which is explicit for separable Systems (6.19). Notice that the starting approxima- 
tions have to be computed only for Gin. Those for Zir are then obtained by (6.20) 
with 有 十 1=0. 

For second order differential equations 刀 = jy) ,where 9g(7) = ,this iteration 
becomes 


ZL = je 十 有 >》 25 二 2)， (6.21) 


和 1 
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Table 6.3. Statistics of iterations (6.20) for Gauss methods (total number of function evalua- 
tions, number of iterations per step) for computations of the Kepler problem over one period 

















with e 一 0.6 
Fixed-point iteration (Separable problems) 

Gauss 彤 二 27r/125 | 灵 二 27r/150 | 帮 二 2r/100 | 玫 二 2r/200 | 帮 二 27r/400 

437 603 857 1201 1717 
RE 8.7 6.0 4.3 3.0 2.1 

613 923 1427 2339 3647 
9 5.6 4.1 3.1 2.4 1.8 

781 1131 1741 3027 5677 
ed 4.9 3.4 2.6 2.2 2.0 























Where ci 一 区 和 Pi aij and Oij are the entries of the square 42 of the Runge-Kutta 
matrix (any Nystrom method could be applied as welD). Dueto the factor j2 in (6.21) 
We expect this iteration to converge about twice as fast as the Standard fxed-point 
iteration . 

The Kepler problem is a second order differential equation, so that the iteration 
(6.21) can be applied. In analogy to the previous tables we present in Table 6.3 the 
statistics of Such an Implementation of the Gauss methods. We observe that for rel- 
atively large step Sizes the number of iterations required per step in nearly halved 
(compared to Table 6.1). For high accuracy requirements the number of necesSsary 
iterations is SUrprisingly Small, and the question arises whether such an implemen- 
tation can compete with high order explicit composition methods. 


Comparison Between Implicit Runge-Kutta and Composition Methods. We 
consider second order differential equations 交 = jy), so that composition methods 
based on the explicit Stormer-Verlet Scheme can be applied. We use the coe 拍 - 
cients of method (V.3.14) which has turned out to be excellent in the experiments of 
Sect.V.3.2. Itis amethod of order 8 and uses 17 function evaluations per integration 
Step. 

We compare it with the Gauss methods of order 8 and 12 (ie.,s 一 4ands 王 0). 
As a starting approximation for the solution of the nonlinear System (0.1) we use 
(6.7) with mm 三 3, HU1 三 0 Ha 三 二 /3 三 1.75, Hi chosen Such that (0.11) holds 
forK 三 1 .5 十 ?一 1 and hj Suchthat order s 十 2is obtained. Since we are 
concerned with Second order differential equations，we apply the iterations (0.20) 
until the norm of the difference of two successive approximations is below 10-17. 

For both classes of methods we use compensated Summation (Algorithm 3.1)， 
which permits us to reduce rounding errors. For composition methods we apply this 
technique for all updates of the basic integrator. For Runge-Kutta methods，we use 
it for adding the increment to yn and also for computing the sum 》 ;| 久 态 . 

The work-precision diagrams of the comparison are given in Fig. 0.2. The upper 
Pictures correspond to the Kepler problem with e = 0.6 and an integration over 100 
periods; the lower Pictures correspond to the outer Solar System with data given in 
Sect.1.2.4 and an integration Over 500 000 earth days. The left pictures Show the 
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Fig. 6.2. Work-precision diagrams for two problems (Kepler and outer solar System) and 
three numerical integrators (Composition method with coefficients of method (V.3.14) based 
on the explicit Stormer-Verlet Scheme and the Gauss methods of orders 8 and 12) 


Euclidean norm of the error at the end of the integration interval as a function of 
total numbers of function evaluations required for the integration; the pictures to the 
Tight present the Same error as a function of the cpu times (with optimizing compiler 
on a SunBlade 100 workstation). We can draw the following conclusions from this 
experiment: 


e the Implementation of composition methods based on the Stormer-VYerlet Scheme 
ls extremely easy; that of implicit Runge-Kutta methods ls Slightly more involved 
because it requires a Stopping criterion for the fxed-point iterations; 

e the overhead (total cpu tme minus that used for the function evaluations) 1s much 
higher for the implicit Runge-Kutta methods; this ls Seen from the fact that im- 
plicit Runge-Kutta methods requjire less function evaluations for a given accu- 
racy, but often more cpu time; 

e among the two Gauss methods, the higher order method is more eftcient for all 
Precisions of Practical interest; 


VJIIU.7 Exercises 335 


for very accurate computations (Say, in quadruple precision), high order Runge 一 
Kutta methods are more efficient than composition methods; 

much of the computation in the Runge-Kutta code can be done in parallel (e.g.， 
the s function evaluations of a fxed-point iteration); composition methods do not 
have this potential; 

implicit Runge-Kutta methods can be applied to general (non-separable) differ- 
ential equations, and the cost of the Implementation ls at most twice as large; 直 
one ls obliged to use an implicit method as the basic method for composition， 
many advantages of composition methods are lost. 


Both classes of methods (composition and implicit Runge-Kutta) are of interest 


in the geometric integration of differential equations. Each one has its advantages 
and disadvantages. 


Fortran codes of these _ computations are available on the Internet under the 


homepage <http:/www.unige.ch/math/folks/hairer>. A Matlab version of these 
codes ls described in E. & M. Hairer (2003). 


VJIU.7 EXxercises 


1. Consider a one-step method applied to a Hamiltonian System. Give a proba- 


bilistic proof of the property that the error of the numerical Hamiltonian due to 
roundoff grows like O(Vteps). 


. Prove that the collocation polynomial can be written as 


四 -12 Zn) 


where the polynomials pi(b are a solution of 


. Apply your favourite code to the 上 epler problem and to the outer Solar System 


with data as in Fig. 6.2. Plot a work-precision diagram. 

RemaK. Figure 7.1 Shows our results obtained with the 8th order Runge-Kutta 
code Dop853 (Hairer, Ngrsett 儿 Wanner 1993) compared to an 8th order com- 
Position method. Rounding errors are more pronounced for Dop833, because 
compensated Summation ls not applied. Computations on Shorter time inter- 
vals and comparisons of required function evaluations would be more in favour 
for Dop853. It is also of interest to consider high order Runge-Kutta NYystrom 
Imethods. 


4. Consider starting approximations 


丽 三 斩 = 2 ) 人 2 的 王 让”“ 杞 
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Fig. 7.1. Work-precision diagrams for the explicit，variable Step Size Runge-Kutta code 
Dop853 applied to two Problems (Kepler and outer solar System). For a comparison, the 
results of Fig.6.2 for the composition method are included 


which use the internal stages of two consecutive Steps without any additional 
function evaluation. What are the conditions Such that (7.1) ls of order s 十 1, of 


order s 十 22 


Compare the effticiency of these formulas with the algorithms (A) and (B) of 


Sect. VII.0.1. 


Prove that for a second order differential equation = UV) (more precisely， 


fory = 2z,= jy))the application of the s-stage Gauss method gives 


2m 十 1 2m 





4 记 于 工 2 习 


gs 





H 太 >》 记 7 加 十 Go， 


人 


where Zin ls obtained from the lteration (0.21). 
LThe coefficients of the Gauss methods satisfy 2 07a7 一 0i(L 一 ci) for 


all ;7. 


Chapter IX. 
Backward Error Analysis and Structure 
Preservation 


One of the greatest virtues of backward analysis ... is that when it zy 
the appropriate form of analysis it tends to be very markedly superior 
to forward analysis. Invariably in Such cases it has remarkable formal 
Simplicity and gives deep insight into the stability (or lack of ib of the 
algorithm. (J.H. Wilkinson, IMA Bulletin 1980) 


The origin ofbackward error analysis dates back to the work of Wilkinson (1960) in 
numerical linear algebra. For the study of integration methods for ordinary differen- 
tial equations, its Importance was Seen much later. The present chapter ls devoted to 
this theory. It is very useful, when the quajlitative behaviour of numerical methods 
is of interest, and when statements over very long time intervals are needed. The 
formal analysis (construction of the modiftied equation, study of its properties) gives 
already a lot of insight into numerical methods. For a rigorous treatment, the modi- 
fed equation, which is a formal series in powers of the step Size, has to be truncated. 
The error induced by such a truncation, can be made exponentially small, and the 
results remain valid on exponentially long time intervals. 


IX.1 Modified Differential 了 quation -- 玉 Xamples 


Consider an ordinary diffteren- 


tial equation ac oOi(yo) 
e 
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A forward error analysis consists of the study of the errors Vi 一 On(yo) docal error) 
and yn 一 pmpn(yo) (global error in the solution space. The idea of backward error 
analysis is to search for a rodified dijerential eqguation y = 记 (7) ofthe form 


放 = 人 攻 ) 十 成 户 ( 信 ) 十 大户 ( 们 十， (GD 
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Such that yn = V(nA), and in studying the difference of the vector fields jy) and 
矿 ( 人 妇 . This then gives much insight into the qualitative behaviour of the numerical 
Solution and into the global error ‰n 一 V(nP) = WP) 一 V(A). We remark that 
the series in (1.1) usually diversges and that one has to truncate it Suitably. The effect 
of such a truncation will be studied in 9ect. IX.7. For the moment we content OUT- 
Selves with a formal analysis without taking care of convergence lssues. The idea 
of interpreting the numerical solution as the exact Solution of a modiftied equation 
is common to many numerical analysts (“... This is possible since the map is the 
solution of Some physical Hamiltonian problem which, in Some Sense, is close to the 
original problem”, Ruth (1983), or “.. .the Symplectic integrator creates a numeri- 
cal Hamiltonian System that is close to the original . ., GOladman, Duncan 儿 Candy 
1991). A Systematic Study started with the work of Griffhiths 狠 Sanz-Serna (1980)， 
Feng (1991), Sanz-9Serna (1992), Yoshida (1993), Eirola (1993), Fiedler & Scheurle 
(1996), and many others. 

For the computation ofthe modified equation (1.1) weputy := V(b forafixedt 
and we expand the solution of (1.1) into a Taylor series 


区 上 十 站 = 十 凡 (CAOO) 十 六 户 ( 轨 十 天 户 ( 轨 十 
2 (1.2) 
得 ( 人 (十 M 国 二 JUG 二 人 二 





十 


We assume that the numerical method Gj (yy) can be expanded as 
环 ( 人 (一 9 二 7/O 十 大 力士 庆 ds(g 轨 十 … (1.3) 


(the coefficient of Pis jy) for consistent methods). The functions dj(V) are known 
and are typically composed of F(y) and its derivatives,. For the explicit Euler method 
we Simply have dj(y) = 0foral7 >2.Inorderto gety(zj) = Vn for all mn we 
must have Vy(t 十 几 ) = 2(V). Comparing like powers of 岂 in the expressions (1.2) 
and (1.3) yields recurrence relations for the functions 方 (y), namely， 


户 () = dy) 一 责 广 jg) (1.4) 


户 切 = 三 扫 一 页 ( 产 ( 信 力 罗 十 PP) -下 (7 户 轨 + 用 7) 


卫 xample 1.1， Consider the scalar differential equation 
y= 因 ， yY0=1 (1.5) 


With exact solution yy 人 (如 = 1/(1 一 力 . Ithas a singularity at 上 三 1. We apply the 
explicit Euler method yn+i = 加 十 几 F(n) with step Size 几 三 0.02. The picture in 
Fig.1.1 presents the exact Solution (dashed curve) together with the numerical So- 
lution (bullets). The above procedure for the computation of the modifed equation， 
implemented as a Maple program (See Hairer 久 Lubich 2000) gives 


IX.1 Modified Differential Equation - Examples 339 





一 一 一 - exact Solution 


[rr 
一 


20 
[ ”一 Solutions oftruncated / 
modified equations / 














.0 .8 1.0 


Fig. 1.1. Solutions of the modified equation for the problem (1.3) 








六 生疏 2 

六 和 了 63 

> fcoe[1] := fcn(y) : 

> for nn from 2 by 1 to nn qo 

> modqed := Sum(h jxfcoe[j+1l]，j=0..n-2) : 

> Qiffty[0] := Yy: 

> oom 了 by 了 tom Qo 

> qiffy[i] := qiff(dqQqiffy[I-1]vy)xmoded: 
> od : 

> ytilde := sum(nh kxqiffy[Kk]/k!，k=0..n) : 
> res := ytildqe-y-hxrfcn(y) : 

> tay := Convetrt (Series (esrh=0rn+1l)v Polynom) : 
> fcoe[n]l := -coeff (tayrhvn) : 

> od: 

> Simplify(sum(nh jx*xfcoe[j+l]，Jj=0..nn-1))， 


Its output ls 


二 总 人 8 ~ 31 ~ 157 过 
二 


15 





0 (1.0) 


The above picture also presents the solution of the modiftied equation，when trun- 
cated after 1,2,3, and 4 terms. We observe an excellent agreement of the numerical 
solution with the exact Solution of the modified equation. 

A similar program for the implicit midpoint rule (.1.7) computes the modified 
equation 





~2 121I~4 17141-~6 16 11~8 18 3 10. 
三 十 几 了 十 风 8Y 十 风 十 风 1 士 :…， (1.7) 
and for the classical Runge-Kutta method of order 4 (left tableau of (II.1.8)) 


下 ,六 65 ~ 17 ~ 19 ~ 
9 
24 示 576 96 出 144 





站 二 了 2 (1.8) 


We observe that the perturbation terms in the modified equation are of Size 
O(P2P), where Dis the order of the method. This is true in general. 
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Theorem 1.2. SuPpPpose tpat 加 e 1netpod yl 三 于 ( 加 ) 1 oforderD Le， 
2j(g) = oz( 仿 十 PP+l6pH(W) 十 OPT2， 


Wjpeye pi(y) denotes 1pe exactjiom ofy = jg) and PP+IOpH1i(y) tpe 1eading 1em71a 
of1je local 1ru1cation ef1or 7T1e 1Iodijied eqguation 加 e1l Sa1isjiey 


= 的) 十 下 疡 本 (人 十 2 人 访 1a( 人 ) 十 0) = 加 (9) 


mi 万 H1(V) = 0p41(y). 


Proof The construction of the functions 方 (y) (see the beginning of this section) 
shows that 方 (人 =0for2 和 7 和 p 直 andonly 让 Gn(y) 一 en) = O(PPT+DT). 














A first application of the modified equation (1.1) is the existence of an asy111P- 
1ofzc expa118Sio1 of 1jpe 8g8Lopal error. Indeed, by the nonlinear variation of constants 
formula, the difference between its solution y(t and the solution y( 雪 ofy = JW 
Satisfies 

攻 四 一 y 的 = Per 从 十 PPTiepii( 轨 十 (1.10) 


Since n = Vonj) 十 O(AN) for the solution of atruncated modified equation, this 
proves the existence of an asymptotic expansion in powers of 几 for the global error 
甸 一 VC) 

Alarsge part of this chapter studies properties of the modified differential equa- 
tion,， and the question of the extent to which structures (Such as conservation of 
invariants, Hamiltonian structure) in the problem 7 = JW) can carry over to the 
modified equation. 


了 xample 1.3， We next consider the Lotka-Volterra equations 
4=dqD 一 JJ 了 =D2 一 9)， 


and we apply (a) the explicit Euler method, and (b) the symplectic Euler method， 
both with constant step Size 下 = 0.1. The first terms of their modified equations are 








加 《=4p-DD 一 za(2 一 m+DD+O(UD)， 
呈 = 一 p( 一 2) 2 (2 20 一 34 十 和 十 OU )， 
中 4=gp-D 一 2 二 一 各 +DTTO(D)， 


六 一 一 (9 一 2) 十 可 2D( 估 十 D 一 59 十 和 十 OU )， 


Figure 1.2 shows the numerical solutions for initial values indicated by a thick dot. 
In the pictures to the jleft they are embedded in the exact fow of the differential 
equation，whereas in those to the right they are embedded in the How of the modi- 
fied differential equation, truncated after the hz terms. As in the first example，we 
observe an excellent agreement of the numerical solution with the exact Solution of 
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(a) explicit Euler, 玉 三 0.1 












































Fig. 1.2. Numerical solution compared to the exact and modified fows 
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Fig. 1.3. Study of the truncation in the modified equation 


the modiftied equation. For the Symplectic Euler method, the Solutions of the trun- 
cated modified equation are periodic, as ls the case for the unperturped problem 
(Exerclse 5). 

In Fig.1.3 we present the numerical solution and the exact Solution of the mod- 
ified equation, once truncated after the 几 terms (dashed-dotted), and once truncated 
after the jh2 terms (dotted). The exact solution of the problem is included as a solid 
curve. This shows that taking more terms in the modified equation usually Improves 
the agreement of its solution with the numerical approximation of the method. 
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了 xample 1.4. For a linear differential equation with constant coefficients 


二 4V， (0) = 0 


We consider numerical methods which yield yn+i = 忆 (h4)Vy， where 民 (z) is the 
stability function (VI.4.9) of the method. mn this case we get yn = 民 (R4)"%o， So 
that yn = yn 四 ,where ab = 尽 (P4)Wnon 一 exp( 半 In 忆 (P4))yo is the solution 
of the modiftied differential equation 


六 二 Im RU)7=(4+jhpos42 十 724 十 ) 了 7 (G.1D 


with suitable constants ba2, 53,... .Since 忆 (z) = 1+2z 二 O(22) andln(1+Z) 一 2 一 
2Z2/2 十 O(Z3) both have apositive radius of convergence, the series (1.11) converges 
for | 由 < jpo with some jo > 0. We shall see later that this is an exceptional 
Situation. In general, the modified equation ls a formal divergent Series. 


IX.2 Modified 上 Equations of Symmetric Methods 


In this and the following sections we investigate how the structure of the differential 
equation and geometric properties of the method are refected in the modiftied differ- 
ential equation. Here we begin by Studying this question for Symmetric/reversible 
methods. 

Consider a numerical method 殉 p. Recall that its adjoint yn+i 三 于 (yn) is 
defined by the relation yn = 有 _n(Vn+1) (see Deftinition I 工 .1.4). 


Theorem 2.1 (Adjoint Methods). Zel 方 () pe tipe coefjicient jnclions or ijpe 
1lOdijied equation 加 or 1je 1ietjpod 2 Tem 1pe coejicient jinctions 方 () of tpe 
1IOdified eqguation jpr 1 加 e adjoint method G7 Satz 认 


让 网 = 全 0 大 由 (2.1) 
Proof The solution y( 力 of the modifed equation for @G;* has to Satisfy V( 坟 三 
百 _nu 人 (Et 二 加) or equivalently, 7 一 四 =_n() withy := 区 . We get (2.1) 让 
we replace 尺 with 一 岂 in the formulas (1.1), (1.2) and (1.3). 














For symmetric methods we have @7 = @j, implying 户 (y) = 方 (四 We there- 
fore get the following corollary to Theorem 2.1. 


Theorem 2.2 (Symmetric Methods),， 771e coe 太 cient japactions or ije 71aodijied 
egUa1io7 Of a sy1ai111et11ic 71etpod Sa1is 廊 方 (9) = 0 whenever7 ievem so 1paf(7.1) 
1as ad11 eXDGa118Sio7 zeye1 PoWe1y or 7 














This theorem explains the /2-expansion in the modified equation (1.7) of the 
midpoint rule. 

As aconsequence of Theorem 2.2, the asymptotic expansion (1.10) of the global 
error is also in even powers of 几 . This property is responsible for the success of 几 ?- 
extrapolation methods. 
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Consider now anumerical method applied to a p-Teversible differential equation 
as Studied in Sect. V.1. Recall from Theorem V.1.3 that a Symmetric method is P- 
Ieversible under the p-compatibility condition (V.1.4)，which is Satisfied for most 
numerical methods. 


Theorem 2.3 (Reversible Methods). Cozzsider a p-7eversiple di1jjerential eqguatiom 
1 三 Jo aad ap-7reversiple taierical1metpod 辐 (V). 7T1e1s every truicatior ortpe 
1IOdi1jied dijjerential eguatio1 18 08a10 0D-7eversiple. 


Prooj Let 方 () be the Jth coefficient of the modified equation (1.1) for @7. The 
proof is by induction on 7. So assume that for 7 一 1 .7, the vector field 方 (y) is 
D-Teversible, ij.e.， 


po 方 = 三 一 方 。0， 
We Show that the Same relation holds also for 7 = 了 十 1.By assumption, the trun- 
cated modified equation 


区 一 丰 信 ) 十 态 户 ( 信 ) 十 十 斌 方太) 


js p-Teversible, so that by (V.1.2), it has a p-Teversible flow prti(y), thatis, powprt 三 
7 o 0. By construction of the modified equation, we have 


(J = er 十 庆生 万 H( 轨 十 OO) 
Since rn) = 十 O(P),this implies 
2 (0) = 2 人 一 Pi( 仿 十 OU) 


Since both On and rn are D-Teversible maps, these two relations yield p e 方 +1 王 
一 方 +1o D as desired. 














IX.3 Modified Equations of Symplectic Methods 


We now present one of the most important results of this chapter. We consider a 
Hamiltonian system 7 = .JIV 万 (y) with an infinitely differentiable Hamiltonian 
刀 (四 ,and we show that the modified equation of symplectic methods is also Hamil- 
tonian 


IX.3.1 Existence of a Local Modified Hamiltonian 


... 赴 weneglect convergence questions then one can always find a formal 
integral . . . (J. Moser 1968) 


Theorem 3.1. 矿 a symaiplectic 1aietpod Dr(V) M applied 1o da aailiomniamz syste 有 1 
Wiij a s1100 纺 Tailioniazn 万 : 月 24 一 限 ，1jen te modijed eqguation (7.7) 让 
QlSoO Fa1atzlto1lia11. More Pr1eciseiy tpere exist S11100 太 Jactio1sy 万 : 了 24 一 及 jor 


了 三 2,3,. .SUc1 tpat 亡 (W) = .TV 万 i(O. 
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The following proof by induction，whose ideas can be traced back to Moser 
(1908)，was given by Benettin 氏 Giorgilli (1994) and Tang (1994). It can be ex- 
tended to many other Situations. We have already encountered its reverSsible verslion 
in the proof of Theorem 2.3. 


Pioof Assume that 万 (y) = .JIVDi( for7 = 12,. .7r (this is satisfied for 
7 一 1, because 亡 () = jy) = .JJ1IV 万 (四 )). We have to prove the existence of a 
Hamiltonian 瓦 7+1(V). The idea is to consider the truncated modified equation 


多 一 人 ) 十 态 户 ( 信 ) 十 十 万 ()， G.1) 


which is a Hamiltonian system with Hamiltonian 瓦 (J) 十 疡 媚 2() 十 .十 1 一 万 (人 . 
Its fow wrtx(yo), compared to that of (1.1), satisfies 


jp(Vo) 一 orn(2yo) 直 Pr 万 和 1(yo) 加 O(Pr+T2)， 


and also 

















21.(V0) 和 Of (0) 十 Pr 太 (yo) 本 O(PrT2)， 


By our asSumption on the method and by the induction hypothesis，2j and rn 
are Symplectic transformations. This, together with or ji(yo) = 了 十 O(P),therefore 
implies 


了 = 允 (go)77 朴 (po) = 7 十 放 (ao)77+TNNaGo))+OUT9) 


Consequently, the matrix .j 广 (is symmetric and the existence of 万 ;+1(V) Sat- 
isfying 亡 fy) = JIVEDHi( follows from the Integrability Lemma VI.2.7. 
This part of the proof is Similar to that of Theorem VI.2.0. 

















For Hamiltonians 妃 : 忆 一 及 thestatement ofthe above theorem remains valid 
with 万 : 忆 一 及 on domains 刀 C 了 2d on which the Integrability Lemma VI.2.7 
is applicable. This is the case for Simply connected domains 忆 , but not in general 
(See the discussion after the proof of Lemma VI.2.7). 


IX.3.2 Existence of a Global Modified Hamiltonian 


By Lemma VI.5.3 every Symplectic one-step method 有 : (D, 9q) 一 (已 CQ) can be 
locally expressed in terms of a generating function 9( 己 ,q, 几 ) as 


D9 O 
This property allows us to give an independent proof of Theorem 3.1 and in addition 
to show that the modified equation is Hamiltonian with 妃 (p, dg) defined on the same 


domain as the generating function. The following result is mentioned in Benettin 信 
Giorsilli (1994) and in the thesis of Murua (1994), p. 100. 
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Theorem 3.2. 4ssu1ae 妨 at 加 e Sy1iplectic 11zetpod Gh jas ad ge1e1atliag jjactio1 


Wi s11100 纺 95j(P9) dejipred on az opemsetD.Tpem tpemaodifed dijerentialeqgua- 
11o11 18 G 瑟 a11a1l1o11ia12 SySte110 WII 


顾 @ 人 = 万 个 二 丽人 四 十 妥 Ps 人 十 ， G.4) 
Were 态 e jinacfions 五 ji(D,9) ae dejizred and s11100 纺 om 1Pe Whole or 门 . 


Proof By Theorem VI.5.7, the exact solution (已 Q) = ( 广 昌 ,5b) of the Hamil- 
tonian System corresponding to 瓦 (p,q) is given by 


059 95 
=P+ 二 (Pot =0+ 二 (Poi 
也 ， 0 4+ 5 ,9， 让)， 


where 5S is the solution of the Hamilton_Jacobi differential equation 


05 六 05 > 
宛 (P9 昌 = 下 (PP9+ 区 ( 忆 ob) Pd0=0. 
这 (9 ,9+55P9ob) SPo0=0 (G.5) 
Since 万 depends on the parameter 几 , this is also the case for S9. Our aim is to 
determine the functions 万 ij(D, 9) such that the solution 9( 己 qd, of (3.5) coincides 
fort 一 尺 with (3.3). 

We first express 9( 己 ,aq, 划 as a series 


5S(P, 9 t5i(Pd 问 十 刀 5a(P0 六 十 妈 53( 忆 0 内 十 .… ) 


insert it into (3.5) and compare powers of 刀 .This allows us to obtain the functions 
9ji(D, 9q, 几 ) Tecursively in terms of derivatives of 万 : 


91(D， d 站) 和 瑟 (P， dg) 





0 万 55 
2 名 (9 月 = 《有 天)@g 朋 G.9 
> 8 万 699 1162? 万 1951 951 


We then write 9j as a Series 


8(d 站 = Si(D;9g) 十 六 Sa(p 0) 上 7125a(p 人 十 ， 


insert it and the expansion (3.4) for 万 into (3.6), and compare powers of 几 . This 
yields 91k(D,9) = ED,9) andfor7 > 1 weseethat Sik(D,9) is a function of 
derivatives of 妃 ) with / < K. 
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The requirement 9(P, q, 几 ) 三 S(p,9 有 站 finally shows 91(Pp,dq) 三 Sil(p， d)， 
592(D, 9) = 912(D,9) 十 921(D,9),etc., so that 
9j(D,9) = 万 j(D,9) 二 "function of derivatives of 瑟 (D,9) with 开 < 广 . 


For a given generating function 9( 忆 9, 力 ,this recurrence relation allows us to de- 
termine Successively the 瑟 ;(p, q). We see from these explicit formulas that the func- 
tions 九 ) are defined on the same domain as the 97， 














As a consedquence of Theorem 3.2 and Theorems VI.5.4 and VI.5.3 we obtain 
the following result. 


Theorem 3.3. 4 sy1aPlectic (Pa1Litio1zed) Runge-Kxiia 11etpod apPplied io Q SySste11 
Wiip soo 态 Fanilfoniaz 万 :万 一 陈 (wi 太 DCRR2d an arpitary open set) jasya 
1iOCljied 本 ailiomia1 (3. 和 wii soo0 太 Jpactio1s 万 ) :万 一 下 . 














了 上 xample 3.4 (Symplectic Euler Method). The symplectic Euler method is noth- 
ing other than (3.2) with (Paq,P) = 几 万 ( 忆 g). We therefore have (3.3) with 
91(D,9) 三 瓦 (D,9) and 9j(D,9) =0for7 > 1.Following the constructive proof of 
Theorem 3.2 we obtain 


~ 用 几 2 2 2 

下 = 如 -了 且 瑟 十 五 (本 形 + 丽 g 形 +4B 丁 本 ) + 6 
as the modified Hamiltonian of the Symplectic Euler method. For vector-valued 7 
and dq, the expression 忆 po is the scalar product of the vectors Zn and 万 and 
也 pp 克 2 三 瓦 pp( 瓦 万) with the second derivative interpreted as a bilinear map- 
ping. 

As a particular example consider the pendulum problem (1I.1.13)，which is 
Hamiltonian with 瑟 (p, g) = 22/2 -- cos gq, and apply the symplectic Euler method. 
By (3.7), the modifed Hamiltonian ils 


2 
万 (pp,dq) = 瓦 (P,dq) 一 psing+ 五 (sin2g+ 瑚 cosg) 十 .... 

This example illustrates that the modified equation corresponding to a Separable 
Hamiltonian (ie., 歹 (p,9) = 了 (D) 十 V(q)) is in general not separable. Moreover， 
it Shows that the modified equation of a second order differential equation 4 王 
--VV(a) (or equivalently, d = D)D = --VU(d)) is in general not a second order 
equation. 


In principle, the constructive proof of Theorem 3.2 allows us to explicitly com- 
pute the modified equation of every Symplectic (partitioned) Runge-Kutta method. 
In Sect.IX.9.3 below we Shall, hpowever， give explicit formulas for the modified 
也 amiltonian in terms of trees. This also yields an alternative proof of Theorem 3.3. 
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JIX.3.3 Poisson Integrators 
Consider a Poisson System, ji.e., a differential equation 
二 JJ)VYE()， (G3.8) 


Where the structure matrix 成 (V) satisfies the conditions of Lemma VIIL.2.3,， and 
apply a Poisson integrator (Definition VII.4.0). 


Theorem 3.S. Ja Poissom integrator 2G1h(y) 1 applied io 1pe Poisson syste11a (3.8) 
1je1 1je 1iodijied eqguation 1 1ocaly a Poisso1 8Syste11. Mo1e pecise 凡 Jor eve 妆 y 
2V0 E 慌 ” 加 ere exisf Gd 1eigHpDourpood LV ad s1100 太 Jpctio11g 万 ) :LU 一 下 sucp 
1jat o1 L 1je 11odijied eqguation 18 of ipe jormm 


六 = BO (YU 十 帮 立 用 人 太 十 12Y 有 太太 十 .， 站 (3.9) 


Pioof We use the local change of coordinates (w,c) = X(V) of the Darboux-Lie 
Theorem. By Corollary VJ.3.6, this transforms (3.8) to 


飞 二 .JIVuwK(wo)， ce=0， 


Where 开 (u; c) 三 万 (y) and Vu is the gradient with respect to wu. The same transfor- 
mation takes @Gjn(y) to XogBpnox-1(0c) = (于 (ccl,wherebyLemmaVIL4.10 
ur 2(u cj is asymplectic transformation for every c. By Theorem 3.1, the mod- 
这 ed equation in the (, c) variables is of the form 


= JIVE( 人 日 ， =0 


with 民 (u ce) = 天 (we 十 彤 Ka(uc) 十 12Ka(uc) 十. Transforming backto 
the V-Variables gives the modiftied equation (3.9) with 五 ;(y) = 开 i(u c). 














The above result is purely local in that it relies on the local transformation of the 
Darpoux-Lie Theorem. It can be made more global under additional conditions on 
the differential equation. 


Theorem 3.6. 矿 万 (V) and 忆 () are dejiped aznd soo 纺 on da SP1y co1z1aected 
doaar 万 and 入 B(O) RN ivertiple on 万 1jen a Poissom integrator Gh(y) jas 4 
1I0dijied equatiom (3.9) wii sootp Jinactions 万 ji() dejized om all or 万. 


Proof By the construction of Sect. IX.1, the coefficient functions 方 (y) ofthe mod- 
过 ed equation (1.1) are defned and Smooth on 刀 . Since 她 (y) is assumed invertible， 
there exist unique Smooth functions 9gj(y) such that 方 (y) = 刀 (y)9j(y). Itremains 
to show that gj() =VDi(V) for a function 五 )(y) defined on 万 . 

By the local result of Theorem 3.3, we know that for every yo E 也 there exist 
functions 万 7 (9) Such that 9j(V) 三 VE (y) in aneighbourhood of yo. This implies 
that the Jacobian of 9gj(y) is symmetric on 刀 . The Integrability Lemma VI.2.7 thus 
Proves the existence of functions 五 )(V) defined on all of 刀 such that 9j(y) = 三 
V 古 人， 














348 IX. Backward Error Analysis and Structure Preservation 


IX.4 Modified Equations of Splitting Methods 
For splitting methods applied to a differential equation 


= j0)+7D(O)， (4.1) 


the modified differential equation is obtained directly with the calculus of Lie deriv- 
atives and the Baker-Campbell-Hausdorfft formula. This approach is due to Yoshida 
(1993) who considered the case of separable Hamiltonian Systems. 


First-Order Splitting，Consider the splitting method 
大 二 gl oo 
where op 辣 is the time- 六 fow ofy = 上 加 (四 ). In terms of the Lie derivatives 也; 
deftined by Dig(y) = 9 ( 力 Jj 国 (四 ,this method becomes, using Lemma IIL.5.1， 
2 到 三 exp(hDo)exp(ADi)Id， 
and with the BCH formula (ULI.4.11), (HI.4.12) this reads 
2 = exp(PD)Id 
with 
太 = 访 十 芒 十 [Da DJ] 二 已 ([Da[Da DIDiIPu DJ) 二 Ga 


It follows that Cn is formally the exact time- 几 fow of the modified equation 


三 = 7 信 with 了 二 万 Id. (4.3) 


This gives 缮 
jg) = 7g 十 户 (加 二 大户 (力士 
with = JP and 


疹 : 汪 3 人 77 轨 /四 7) 
方 = 瑟 (7 (7 四, 7) 下 7 7 7D 7 加 (7D， FD]) 2 7 加 7 7DI 
上 + (fm,FD)+ 上 + AD- (FJD) 一 7 7 四 AD)， 





Strang Splitting，For the Symmetric splitting 


S 1 2 1 
BCpja ooP opRN2 


the Symmetric BCH formula (HUUI.4.14), GUI.4.153) yields 
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6 和 exp( 二 D) exp(AD>) exp( 二 D) Id = exp(PDB)Id 


WwWith 
渤 


方 I] _ 21 
方 = Di+Da+PB( 六 


[Du[Du Dal+ 囊 [Da,[Da, DJ) + (4 


Hence， GD is the formally exact How of the modiftied equation 
= Je 人 with jg = DBId. (4.5) 


This gives 
杀人 = 7 人 + 民有 (全 二 用 让 ( 乓 十 … 
with 上 = JD 二 FDI and 


中 至 (天 (四 “7 四) 兴 7 7 7 7 加 (fm ,7 加 ) FOOD) 
1 (7 加 (7FD, 7D) 站 7 加 70 7DI 壮 70“(7 四 ，FD) 本 7 7 四 AD) 


24 
The modified equations for general splitting methods (HI.3.13) are obtained in the 
Same way, Using Lemma II.3.3. 


Hamiltonian Splittings. Consider a differential equation (4.1) where the vector 
fields /加 (y) = JIVTFO(N are Hamiltonian. Lemma VIIL3.1 shows that the com- 
mutator of the Lie derivatives of two Hamiltonian vector fields is the Lie derivative 
of another Hamiltonian vector field which corresponds to the Poisson bracket of the 
two Hamiltonians: [DP, De] = Dic 下. This implies in particular that the modi- 
fied differential equations (4.3) and (4.5) are again Hamiltonian. For the first-order 
splitting, we thus get 亡 (J) = .JIV 万 (四 ,where by (4.2) and (4.3)， 


肋 - aa 


五 = 下 ({E 四 ,本 加] +{{ 且 加, 瑟 吕 }, ED) )， 
and for the Strang Splitting, by (4.4) and (4.3)， 
BR = - 吉 { 瑟 加, P], 瑟 ] - 巧合, 瑟 由 }, 鼠 加]}， 


The explicit expressions from the BCH-formula show that the modified Hamiltonian 
is defined on the same open set as the smooth Hamiltonians 万 加 . 

For the splitting 互 (z,9q) = 工 (p) + UV(q) of a separable Hamiltonian, this ap- 
proach gives an alternative derivation of the modified equation (3.7) of the Sym- 
plectic Euler method, and a Simple construction of the modified equation of the 
Stormer-Verlet method (Yoshida 1993). Here, the formula simplifies to 


交 工 1 
启 B = 吾 + 岂 (-- 击 Do(T 切 上 + 二 De) + 46 
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IX.S$ Modified 上 Equations of Methods on Manifolds 


We consider the relationship between numerical methods for differential equations 
on manifolds and the associated modified differential equations. We give appli- 
cations to the study of first integrals，constrained Hamiltonian Systems,， and Lie 一 
Poisson integrators. 


IX.S.1 Methods on Manifolds and First Integrals 


Consider a differential edquation on a Smooth manifold A4， 


二 JJ) with jy) ET G.1D 
with a smooth vector field /yy) defined on 人 4， 


Theorem S.1. Zer 2 : 人 一 Ape an integjator on 妨 e 11a1 加 1 AL mwi 态 环 (W) 
depe1dijz8g S110otp1y oz (2 几 ). Te tpere exisf Ga 11odijiea dpierenfial equation 
O1 人 4， 
“= 人 十 六 户 们 士 生 记 人) 十 … (5.2) 
Wi s11oo 术 万 () E 克 A sucp tpatorpny) 三 开 () 十 Or where rt 
de1totes 1pe Jiomw of tpe 1runcatioz of (5.2) qierr 1e11118. 
FoDr sy1z11etric 11etjpods, 1pe expDa118io11 (5.2) co11ai18 oly eve1l PoWe1s or 六. 


Pioof We choose a local parametrization y = X(z) of the manifold 人 4. mn the 
coordinates z the differential equation (9.1) reads 


一 下 (2) with 天 (z) definedby  X( 2 已 (z) 下 X(2))， 
and the numerical integrator becomes 
琴 (2) 一 XTLoGhox(2， 


Since 已 (z) and 叶 (z) are smooth, the standard backward error analysis on 琢 "” of 
Sect.IX.1 yields a modified equation for the integrator 万 (2)， 


之 = 下 (2) 十 PE( 2 十 2 丙 ( 轨 十 .… . 


Deftining 
方 (人 =X 有 (人 for y=X() 
gives the desired vector fields 方 (V) on 人 4. It follows from the uniqueness of the 
modified equation in the parameter space that 方 (y) is independent of the choice of 
the local parametrization. 
The additional statement on Symmetric methods follows from Theorem 2.2, be- 
cause 几 人 Symmetric 过 and only 让 2h ls Symmetric. 














Under an analyticity asSumption, the converse statement also holds. 
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Theorem S.2， Zetl 1Pe ifegfator Oh :LU 一 开 ” (wiiuopen 7 C 了 ") pe 7eal 
G1G1yfic 太太 ad ef 人 三 1yEUi9O) =0 wirealanayticg :一 及 了. 
矿 1pe coefjicient Jincfions 方 (y) of ipe maodijied dijerential eqguationm (5.2) Satis 放 
9 人) 方 (加 三 0Jjoral7andalyeE 人 1 加 en tpe restriction ofGh io 人 dejines a1n 
171eg1ator Oo1 人 4 1 e, Oh :4 一 人 41. 


Proof By the assumption on 方 (y), the fow of the truncated modiftied equation 
satisfies go prn(y) =0foralr 之 1andalyeA 人 ASince prn(y) = 环 (y) 十 
O(Pr+l), we have go (0) = O(r+l) for all >. The analyticity assumptions 
therefore imply go Gjp(V) = 0. 














Theorems 5.1 and 3.2 apply to many Situations treated in Chap.IV. 


First Integrals. The following result was obtained by Gonzalez, Higham 儿 Stuart 
(1999) and Reich (1999) with different arguments. 


Corollary $.3. Comsider adijerential equation = FU) wii astiptegralT(y)， 
e, (JJ 三 0jJorall yy 和 矿 tpe aaerical 1aietpod pyeservesy 切 18 Jst integ1Q1 
态 e1a eyey trication ortpe 11odified equation jasy 了 TV) as ayst Integ7aL 


Poof This follows from Theorem 5.1 by considering wy = jy) as a differential 
equation on the manifold A4 = {y; T(y) = Comnstj, for which the tangent space is 
mAM= {foiFl)o= 0 














The following converse of Corollary 5.3 is a direct consequence of Theorem 3.2. 


Corollary $.4. Comsider a dijerential equation = Jo) mw 训 a 7eal-aaalytic jzst 
ifeg1al 7T(). 矿 je aaerical 1aetpod 于 (V) 入 Jeal analytic 太太 aad 矿 evemy 
1rz1acatio1 of tpe 11odified eqguation pay TV) as ast integ1a1 1Pez tpe 11aerical 
1zetpod preserves T(y) exacty Le, IT(@1( 人 ) = 了 TV)Joral yy 














Projection Methods. Algorithm IV.4.2 defines a smooth mapping on the manifold 
让 the direction of projection depends smoothly on the position. This ls Satisfied 
by orthogonal projection, but is not fulfilled 让 Switching coordinate projections are 
used (as in Example 4.3). The symmetric orthogonal projection method of Algo- 
rithm V.4.1 gives a Symmetric method on the manifold to which Theorem 5.1 can 
be applied. 


Methods Based on Local Coordinates. If the parametrization of the manifold em- 
ployed in Algorithms IV.5.3 and V.4.3 depends smoothly on the position, then again 
Theorem 5.1 applies. This ls the case for the tangent Space parametrization, but not 
for the generalized coordinate partitioning considered in Sect.IV.5.3. 


Corollary S.S (Lie Group Methods). Cozzsider adi1jerential equation o1 G 11IG1FIX 
Zie 810OUD @， 
Y= 4(Y) 


Where 4() 站 太 1e associated Lie algep1a g 4ZLiegroxp integratorGh:G 一 G 
jas 1jpe 1iodijied eqguatiom 
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艺 = (4( 区 +j4a( 人 十 124s( 信 十 .这 (5.3) 
wii 4) sg JporyecG. 


Poof This is a direct consequence of Theorem S$.1 and (IV.6.3)，Vviz.，TyYCG 王 
{4Y7l4egl. 














JIX.S.2 Constrained Hamiltonian Systems 


In Sect.VIIL.1 we studied symplectic numerical integrators for constrained Hamil- 
tonian Systems 


9 “ 五 p(PD,9) 
六 = 一 囊 ( 人 9) 一 G(O7 和 (5.4) 
0 = 9(9). 


Assuming the regularity condition (VIL1.13), the Lagrange parameter 入 王 A(D, 9) 
is given by (VJH.1.12). This System can be interpreted as a differential equation on 
the manifold 


AM={Cg)19(0) =0 G(OB(D9) =0}， (5.5) 


where G(9) = 9(q). The symplectic Euler method (VI.1.19)-(VIIL.1.20), the RAT- 
TLE Scheme (VJI.1.26), and the Lobatto IIA-IUIB pair (VIH.1.27)-(VYIH.1.30) were 
found to be symplectic integrators 2n on the manifold 人 4. 


Theorem 5$.6. 4 sy1aipliectic ipfegfalor 2 : 人 一 人 Jor1pe comnstrained 瑟 a11211- 
1O111G1 syste111 (2. 有 1as a 7111odijed eqguation which is localy or ipe jomm 


站 = 五 5(D,9) 
万 三 一 瓦 ( 访 们 一 GD (3.0) 
0 = 9(9)， 


where 入 一 入 (六 gzgmvenpy(YVE7112) mi 吏 万 7eplaced py 万 and 
万 人) = 万 2 人) 二 天 (人 人 十 尼 及 (人 四 十，… G.7) 
WiU 五 j(D;,9) satishing G(q)VpEi(D9) 三 0Jor(D9) EA 人 anda117. 


Pioof As explained in Example VJIIL2.7, a local parametrization (DP,q) = X(z) of 
the manifold A4 transforms ($.4) to the Poisson System 


必 一 巨 (z)V 开 (2 (3.8) 
with 刀 (z) = (X(z)77JX(z))-L and 开 (z) = 瑟 (X( 2). Lemma VIL4.9 implies 


that the numerical method Gjn(D, 9) on 人 becomes a Poisson integrator 内,(z) for 
(5.8). By Theorem 3.5, 败 (z) has the modified equation 


六 = 妃 ( 习 (YKO 二 PRVFKo 习 十 12VFK3O 习 十 . 小 (5.9) 
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Let rr be asmooth projection onto the manifold 人 4, defined on a neighbourhood of 
人 ip 琢 24. We then define 


厅 ( 人 9g) = 0) 二 APG(OJV5 瑟 (pg9) 


Where we choose HA(D,q) such that 
COVpEji 9)=0 for (9 EA4. (5.10) 


This ls possible because of the regularity asSsumption (VI.1.13)，and because 
G(q)Vp(p,9) = 0 on 人 .The condition (5.10) implies that the system (3.6) can 
be viewed as a differential equation on the original manifold A4. Using the same 
parametrization (D, g) 王 X(z) as before shows that (5.6) is equivalent to (3S.9). 














We note that，due to the arbitrary choice of the projection T，the functions 
五 i(p,dq) of the modiftied equation are uniquely defined only on 人 4. 


Global Modified Hamiltonian. If we restrict our considerations to partitioned 
Runge-Kutta methods, it is possible to find 万 ;(p, 9g) in (5.7) that are globally de- 
fined on 人 4. Such a result is proved by Reich (1996a) and by Hairer 久 WannerT 
(1996) for the constrained Symplectic Euler method and the RATTLE algorithm, and 
by Hairer (2003) for general symplectic partitioned Runge-Kutta Schemes. We fol- 
low the approach of the latter publication，but present the result only for the im- 
portant Special case of the RATTLE algorithm (YI.1.26). The construction of the 
五 ij(p,d) is done in the following three steps. 


teDp 17. 9y11DUectic EXte1lsio11 of 1jpe Metjpod 1o da NeigHpoxjpood or 1pe Ma1zt 加 以 
The numerical solution (zi, qi) of (VIL.1.26) is well-deftined only for initial values 
satisfying (po, qo) E 人 4. However, 这 we replace the condition “g(ql) = 0”by 


9g(q1) = 9g(9qo) 十 尹 G(qo) 瑟 (Do, 9o)， (3.11) 
and the condition“G(dqi) 瑟 (pl,9qi) = 02”by 
C(q)Ep(p191) = G(qo)Ep(po, go)， (3.12) 


then the numerical solution is well-defined for all (2o, qo) in an A-independent open 
neighbourhood of A4 (cf. the existence and uniqueness proof of Sect. VII.1.3). Un- 
fortunately, the So-obtained extension of (VJIIL.1.26) ls not Symplectic. 

JInspired by the formula of Lasagni for the generating function of (uncon- 
Strained) Symplectic Runge-Kutta methods (See Sect. VI.3.2), We let 


尺 
SOum 崩 = 了 (Euyog) 十 吾 oayzg)+gGo)7A+g(oD)TA) (6.19) 
六 2 


了 
富 开 ( 函 (puyag) 十 G(oJrn] (本 np， d0) 十 孙 (pjg))， 


Where po, pl/2,D1, 90,9q1, 和 Nare the values of the above extension. In the defini- 
tion (5.13) of the generating function We consider po, pl/2,91, 入 Nas functions of 
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(pl1,q0)，what is possible because pl = po 十 O( 由 .With the help of 9(P,q,P) we 
define a new numerical method on a neighbourhood of A4 by 


Do = Dll 十 9v(P1, go 7)， dl1 三 900 十 Snp(D1,9o, 几 ). (S.14) 


This method 1is Symplectic by definition， and it also coincides with the RATTLE 
algorithm on the manifold A4. Using the fact that the last expression in (3.13) equals 
(D1 一 Di/2)7 (di 一 qoj, this is seen by the same computation as in the proof of 
Theorem VI.3.4. 


Step 2. 4Dplication oftpe Resulis ofsect .了 .2. The function 9(pl, 9qo, 几 ) of (5.13) 
can be expanded into powers of 凡 with coefficients depending on (pl1,dqo). These 
coefficient functions are composed of derivatives of 万 (p,9) and 9qg(dq) and, conse- 
quently, they are globally deftined. For example, the 尹 -coefficient is 


S1(D1, go) = 歼 (pl,go) 十 g(qo)7XA(Di go)， (5.15) 


where X(D,dq) is the function defined in (VII.1.12). 

We are thus exactly in the Situation，where we can apply Theorem 3.2. This 
proves that the method (5.14) has a modified differential equation with globally 
defined modified Hamiltonian 


In particular, the constructive proof of Theorem 3.2 shows that 五 1(D, q) = 51(D,9) 
with 91 (pz, q) from (5.15). 


tep 了 Backi1nte1PD1elation Jor 1jpe Metpod om 1je Ma 加 1 Since the RATTLE al- 
gorithm deftines a one-step method on 人 4, it follows from Theorem 5.1 that every 
truncation of the modified differential equation 


方 = 一 Vv 万 (六 并 = Vp 末 (六 了 (5.17) 


is a differential equation on the manifold A4. Terms of the form 9(d)7w(p;, 9) ip 
万 se(D， d),，which vanish on AM, give rise to -9g(0J7 uv(p;9) - G(O7A(p ao) and 
9g(q)7wpp(p,9) in the vector field of (5.17). On the manifold 人 MM，where 9(q) = 
0, only the expression 一 G(g)7(p,9) remains. Consequently，we can arbitrarily 
remove terms of the form 9g(q)77(p;,9) from the functions 万 ;(p,9) in (5.16), 让 
we add aterm -G(q7Ain the differential equation for p with 入 defined by the 
relation g(9) = 0. This then gives a problem of the form (3.6) with globally deftined 


五 j(p, 9)， 


JIX.S.3 Lie-Poisson Integrators 
As in Sect.VJIIL.3.3 we consider a Symplectic integrator 


LPG = 下,Qo) on 7TCG 
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for the left-invariant Hamiltonian system (VIIL.3.43) on a matrix Lie group C with a 
Hamiltonian 丈 (已 @) that is quadratic in 尸 . We suppose that the method preserves 
the left-invariance (VJI.3.54) so that it induces a one-step map 


2 一 夸 (Yo) On 


by setting 交 = QTP for ( 己 ,Qi) = 于 (PQo) with Q1 而 = 区.Thisis a 
numerical integrator for the differential equation (VIL.3.37) on g ,and in the coor- 
dinates y = (j) with respect to the basis (7) of g” this gives a map 


J1 三 yo) on 及.， 
which is a numerical integrator for the Lie-Poisson system 7 = 已 ()V 瓦 (V) with 
已 (V) given by (VII.5.35). 


Theorem S.7. 矿 辐 (已 Q@)iasyPplectic ad 1eHji-iavariantinategratorjor(Y75.43) 
Wicp is 7eal aaatytic 太 见 态 en 这 Teduction (UV) 国 a Poissom integrator Mo7e- 
oveF 欧 ( 了 ) Preserves 1jpe coadjoiat orpit ie 不 (了 ) <s{Ady yieCGl. 


Proojf (a) In the first step one Shows, by the standard induction argument as in the 
proof of Theorem 2.3, that the modified equation given by Theorem 5.0， 


六 = -ve 加 - 》 Nivog(O， 0= ve 到 g 
1 一 1 ($.18) 
0 本 0i(G@)， 1 三 |， ) 77， 
WwWith 本 
刀 (PG) = 万 (PQ@)+TAhE2(PQ)+12Bs(PQ) 二 .…. 
ls left-invariant, ij.e.， 
万 (0 忆 TIOQ) 三 五 1(PQ) foralV edGandall7. (5.19) 


(b) The Lie-Poisson reduction of Theorem VJII.5.8 yields that 这 (已 (， QO) GE 
T*G is a solution of the modified system (5$.18), then 立 介 = G(OTB(D e 旦 
Solves the differential equation 


( 芭 , X) = 人 [ 序 (区 foral Xecg (5.20) 


Theorem VJI.3.6 Shows that its solution lies on acoadjoint orbit. By Theorem VI.3.5， 
(3.20) is equivalent to the Poisson System 


= 有力 V 万 人力 . (5.21) 


(c) We know already from Theorem VJIIL.S.11 that WwW)(y) is aPoisson map. Since 
all truncations of the modified equation (3.21) have the Casimirs as first integrals， 
their preservation by Wn follows from Corollary 5.4. Similarly, the Preservation of 
the coadjoint orbits follows from Theorem 5.2. 














In contrast to Theorem 3.3，we here obtain a global modiftied Hamiltonian in 
the modified Poisson System 过 the method is obtained by the discrete Lie-Poisson 
reduction of the RATTLE algorithmi; see the preceding Subsection. 
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IX.6 Modified 上 Equations for Variable Step Sizes 


The modified differential equation of a numerical integrator depends on the step 
Size employed. Therefore, ifthe step Size ls changed arbitrarily, a different modified 
equation occuIs at every Step. This ls the reason for the poor longtime behaviour 
observed in Sect.VII.1. On the other hand, a satisfactory backward error analysis 
is possible for the variable-step approaches of Sects.VJIU.2 and VIIUL.3. 


Time Transformations. The adaptive approaches of Sect. VII.2 amount to apply- 
ing afxed step Size method to atransformed differential equation. Hence, the back- 
ward error analysis conslidered so far applies directly and yields modified equations 
for the transformed problem. These modifed equations are Hamiltonian for Algo- 
Tithm VJIU.2.1 and reverslible for method (VIU.2.12). 


Proportional，Reversible Step Size Controllers. As in Sect. VII.3.1 we let the 
Step Size be of the form 
jn 二 172 一 ESs(Wns)， (60.1) 


where s ls a Small accuracy parameter. It is not allowed to use information from 
previous steps. The idea ls to work with expansions in powers of the fxed parameter 
sinstead ofthe step Sizes, and to consider the exact solution of the modified equation 
on a variable grid. The following development is given in Hairer 多 Stoffer (1997). 
It extends the results of Sects.IX.1 and IX.2 to variable step Sizes. 


Theorem 6.1. Zef Gpn(y) pe ad s11oo1t1 one-sfep 111e 加 oQ. 
夫 Tje variable-step 1ietjod y 情 Eee) asa7aodijied derential egua- 
1O7 


Y = j 信 十 < 户 ( 押 十 全 户 ( 信 十 … ， (6.2) 
Wi sz1100 态 vectorjielads 方 (y)， sc tat 
me sc)(9) = 9 s(V;E) (9) O( 人 er ) ) (0.3) 


Wje1e rt(y) denotes 1pe Jow oftipe truncation or(6.2) afier7 te11115. 
中 不 ie maetpod 站 Sy1amaetric (ie By) = (人 ) and s( 办 一) = (hs) 
jpolds wi 态 攻 = Esc)()， 芒 en tje extpa11sion (6.2) 1 六 eye Powe1sy of ee， 


方 ( 轨 =0 Jor eve1 7 (6.4) 


cj 1ripemmaetpodizs p-reversiplefie ,po 辐 一 25 opjands(p-1I 办 =) 一 s(e) 
jpolds wm 六 攻 = Besoyc)() ipen tpe7aodiied equation (6.2) 88 10-7eversible Le， 


po 方 = 一 万 op Jor a117. (0.3) 


Prooj al) The modified equation (6.2) is constructed by Taylor expansion of (0.3) 
in the same way as (1.1), using s-expansions instead of 凡 -expansions， 

For the proof of the statements (b) and (c) we denote, as we did in Sect. VJII.3， 
死 () = esoy eV). We then compute the dominant error term in (6.3) and obtain 
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玫 (O) 二 Oriesty ec)() 十 cs(0 E) 亡 +H10) 十 O(erf). (0.0) 


With the aim of getting an analogous formula for 到 1, we put 分 = 于 (y) and use 
or 一 pr ti(y) so that 


V 二 W%r, 一 es(Ve) (分 一 Er s(， 5) 方 +1(V) 相 O(c 一 ))， (0.7) 


b) Inserting s(y,s) = 5( 办 一 s) into (6.7) and using the facts thaty = 也 十 O(s) 
and that the derivative fi(V) is O(-close to the identity, we obtain 


下 (二 = = Jr-esG@-a 办 -ce (办 0) 太 1 力 +Oc ) 6.9) 


By (VJHI.3.3) we have 凤 = 三 允 -. Changing the Sign of s in (6.8), a comparison 
WwWith (6.6) proves that 户 +1(y) = (-1)7 记 51(V) implying (6.4). 
c) With s(y, es) = s(p-! 人 s) formula (6.7) yields 


Z (办 ‰%m, 一 es(p 一 11e) ( 纺 cr+1s(p 从 0) 广 +1( 娘 十 O(e 2). 


By an induction argument on 7 we asSsume that Do rt 三 %r -to 0. The p- 
Teversibility of 玫 ,iepo 匹 = 玖 opthusimplies the statement (6.5). 














JIntegrating，Reversible Step Size Controllers. We next study a backward error 
analysis for Algorithm VIIU.3.4. It is possible to interpret this algorithm as the fxed 
Step Size method G of (YIU.3.19) applied to the augmented System (VJIIU.3.17) and 
to apply the construction of Sect. IX.1. This approach has been taken in Hairer 公 
Soderlind (2004). In view of an error analysis for reversible integrable Systems 
Seemas to be more convenient to consider the Solution of the modifed equation on a 
variable grid as it is done in Theorem 6.1. 

Let us recall Algorithm VII.3.4. For a given basic integrator @Gjp(Vy) and a given 


time transformation c(y) we denote G(V) = 一 (c(J)) 一 Von) and we com- 
pute for a given initial value yo and with z0 = 1/c(yo) 
2m 十 1/2 “人 加 十 EG(n)/2 
Zn+l 一 Ze za (on) (0.9) 
2+1 二 2+1/2 十 EG(n+1)/2. 





The values yn approximate y( 妇 ), where 如 +1 三 如 十 /zl1/2. We further use the 


notation 
(yn Vmn 十 1 ~ (2 0 
| and (1 (6.10) 


The step Size used in this algorithm is 








工 
一 ss(Wn22,E) with ss(yz,E) 一 (6.11) 


及 三 ne 
全 2 十 1/2 2 十 EcG(V)/2 


The Symmetric definition of the algorithm immediately yields 
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5 办 力 一 5) 二 5(y， 忆 ， 5) for (办 必 ) 和 匹 (y， 2 (0.12) 


For a p-reversible differential equation y = jy) and for o(y) satisfying o(por-1y) 一 
aly) wehave G(p-1Ly) = -G(W). Consequently, the step Size function s(y, >, =) of 
(6.11) also satisfies 


s(pr- 欧 信 -ej=s(y ze for 众人 = 下 (四 (6.13) 
With this preparation we are able to formulate the following result. 


Theorem 6.2. ZLef GBP (V) pe a s11oo 态 ole-step 111etpod al(y) as1ioo 态 11e 1ra118- 
Paatiom da11d s(V 2,E) 1pe step size jzactio of(6.7 人 7 
Cj Fortipe 1ietpiodg or(6.710) 1jere exis18 QG1iodijied di1erential eqguatiom 


六 = 7 加 上 +e 户 作为 上 +e2 户 六 习 十 
2 = >G( 信 +eG2z( 信 习 十 e2Gs( 世 习 十 …… ， 
Wi s110o 太 vectorjielas 方 (0 2，GjV 2)，SUCcP ta 
prestyzel(0 2 三 玫 ( 人 十 O(e)， (6.15) 


Wje1e prti(y 2) denotesthe jiow ortpe truncation ofipe syste1af6.14) dfier7 te11105. 
吧 太 1pe pasic 1metpod jy syma1aetric (ee En(y) 一 5 (人 ) Pen 


万 ( 轨 =0 Joreven 7 (6.10) 


CJ Tipepasicmaetpodisp-reyrersiblefie,po 思 = 国 Lopjandc(po-ly) = 
jolds, 加 en 1jpe 1podijied equation (6.74) 1 DO-7eversiple mw 了 PPgive py (6.70) Le,， 


p 广 (2) = 三 一方 [00 2， Gj 思 2 三 一 GO 2 Jorall7. (6.17) 


Prooj The proof is the Same as for Theorem 6.1 and therefore omitted. Notice that 
the step Size function Satisfies (6.12) and (6.13) which are needed in that proof. 


(6.14) 














H 开 the basic method is of order p then the coeffticient functions of (6.14) Satisfy 
万 (2) =0for7y 三 2,...,D. Wealways have Ga(yz) = 0 dueto the symmetric 
Way of choosing 2zr+1/2 让 (6.9). However Cs(V2z) 夭 0 in general, even 证 the 
method 2n has an order higher than two. 


JIX.7 Rigorous Estimates - Local Error 


Wherefore it is highly desirable that it be clearly and rigorously shown 
why series of this kind, which at first converge very Tapidly and then ever 
Imore Slowly, and at length diverge more and more, nevertheless give a 
Sum close to the true one 让 not too many terms are taken, and to what 
degree Such a sum can Safely be considered as exXact. 

(a footnote in Gauss” thesis, 1799) 


Up to now wehave considered the modified equation (1.1) as a formal series without 
taking care of convergence lssueS. Here， 
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we Show that already in very Simple situations the modified differential equation 
does not converge; 

e We give bounds on the coefficient functions 方 (y) of the modiftied equation (1.1)， 
So that an optimal truncation index can be determined; 

we estimate the difference between the numerical solution Wi = 环 ,(yo) and the 
exact solution V(h) of the truncated modified equation， 


These estimates will be the basls for rigorous statements concerning the long-time 
behaviour of numerical solutions. The rigorous estimates of the present Section have 
been given in the articles Benettin 儿 Giorgilli (1994), Hairer 儿 Lubich (1997) and 
Reich (1999). We mainly follow the approach of Benettin 儿 Giorgillj, but we also 
Use ideas of the other two papers. 


Example 7.1， We consider the differential equationl 7 = / 太 , V(0) = 0, and we 
apply the trapezoidal rule Wi = 尹 (jj0) 二 人)/2. In this case, the numerical 
solution has an expansion 到 (人 访 2) = 十 帮 (J 蚊 十 FE 十 办)A2 = 和 十 妃 AD 十 
12 户 人 的 /2 十 友 记 扫 /4 十 .sothat the modified equation is necessarily of the 
form 

攻 三 外 有 十 Ri 办 十 大 轨 记 乓 十 71203707 十 (7.1) 


The real coefficients can be computed by putting j 厂 = et The relation 
瑟 ( 芒 0 一 vt 十 (with initial value y(b) = yields after division by ef 


人 人 一 (二 六 十 术 且 二 区 展 二 (的 一 巧 ， 


This proves that 1 = 0, and 太一 刀 V/KEL where Bh are the Bernoulli numbers (See 
for example Hairer 儿 Wanner (1997), Sect.I.10). Since these numbers behave like 
盏 /Bl SConst (27 for8 一 co, the series (7.1) diverges for all 刀 夭 0, as 
SOon as the derivatives of 让 ( 划 grow like Fo (四 s 有 RE. This is typically the 
case for analytic functions jz with finite poles. 

Itis interesting to remark that the relation 环 ( 太 V) 一 Vt 十 尹 ) inothing other 
than the Euler-MacLaurin Summation formula. 

As aparticular example we choose the function 


5 


1 


Figure 7.1 shows the numerical solution and the exact solution of the modified equa- 
tion truncated at different values of N. For 中 = 0.2, there is an excellent agreement 
for N 所 12, whereas oscillations begin to appear from N = 14 onwards. For the 
halved step Size 由 三 0.1, the oscillations become visible for N twice as large. 


1 Observe that after adding the equation 达 = 1, t(0) = 0, we getfor 和 = 仿 力 7 the 
autonomous differential equation Y = 下 (六 ) with 严 (7) = (1 区 ). Hence, all results 
of this chapter are applicable. 
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Fig. 7.1. Numerical solution with the trapezoidal rule compared to the solution of the trun- 
cated modified equation for 尺 = 0.2 (upper four pictures), and for 岂 = 0.1 (Lower two 
Pictures) 


The main ingredient of a rigorous backward error analysis is an analyticity as- 
Sumption on the differential equation y = jy) and on the method. Throughout this 
Section we assume that F(V) is analytic in a complex neighbourhood of yo and that 


|ASWM for 人 一 ols2R (7.2) 


ie for al y of Bop(yo) := ts Caily=- ol < 2R}. Our strategy is the 
following: using (7.2) and Cauchy's estimates we derive bounds for the coefficient 
functions dj(y) of (1.3) on BR(yo) (Sect.IX.7.1), then we estimate the functions 
方 (y) of the modified differential equation on 妃 R/z(yo) (Sect.IX.7.2), and finally 
we Search for a Suitable truncation for the formal series (1.1) and we prove the 
closeness of the numerical solution to the exact solution of the truncated modified 
equation (Sect. IX.7.3). 


IX.7.1 Estimation of the Derivatives of the Numerical Solution 


If we apply a numerical method to y = jy) with analytic jy)，the expression 
Gj(Vy) will usually be analytic in a neighbourhood of 尺 = 0 andy E BR(Vo). 
Consequently, the coefficients qdj(y) of the Taylor series expansion 


@1(y) = 十 PC) 二 jado(y) 十 jds(y) 十 (7.3) 
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are also analytic and the functions qdj(V) can be estimated by the use of Cauchy”s 
inequajities. Let us demonstrate this for Runge-Kutta methods. 


Theorem 7.2，For da Rzzzge-Kztta 17etpod (11.7. 因 1et 


/= 》 | 虽 |， 一 ,Iax ol (7.4) 
i 一 1 


区 


矿 F) maaalytic 碎 加 e comiplex pa1l BaoR(Vyo) aad satisjies (7.2) 1Pen te coe 扩 - 
cient Jimactions dj(y) of(7.3) a1e analyficm BR(yo) amd satis 记 


2pMAN71 
尽 





Ilsox 人 甸 必 -< 有 03 
Pioojf Fory E Bspjz(yo) and |Ayl < 1 the function a(z) = jy 十 zAy) is 
analytic for |z| < 刃 /2 and bounded by M. Cauchy's estimate therefore yields 


上 Gayl = le (OIL S 2/ 忆 . 


Consequently, | jg E 2AMV/Rfory e Basny/z(yo) in the operator norm. 

Fory E BR(yoj, the Runge-Kutta method (IIL.1.4) requires the solution of the 
nonlinear System 90; 一 y 十 几 > 7-1 aijj 帮 9)，which can be Solved by fixed point 
iteration. 开 | 由 26AM/ 及 和 7Y < 1 it represents a contraction on the closed set 
{( 9 gs); lo 一 2 么 忆 /2} and possesses a unique solution. Consequently， 
the method is analytic for |P < ?7R/(2nAM) andy Ee BR(yo). This implies that the 
functions dj(y) of (7.3) are also analytic. Furthermore, |2z(y) 一 引 么 | 由 AAA for 
VE BR(Vyo) so that, again by Cauchy's estimate， 











忆 的 1= 天 | 六 (人 鸭 - 中 sex 人 


fory7 > 1. The statement is then obtained by considering the limit 7 一 工 . 














Due to the consistency condition 亚 二 | ui 三 1, methods with positive weights 
wo all satisfy / = 1. The values /,A of Some classes of Runge-Kutta methods are 
given in Table 7.1 (those for the Gauss methods and for the Lobatto IIA methods 
have been checked for s 和 9 and s 冬 5, respectively). 

Estimates of the type (7.3), possibly with a different interpretation of AM and 尺 ， 
hold for all one-step methods which are analytic in 尺 and y, e.g., partitioned Runge 一 
Kutta methods, Splitting and composition methods, projection methods, Lie group 
methods, ...， . 


Table 7.1. The constants 4 and A of formula (7.4) 








Imethod 几 乒 Imethod 内 | 抑 
explicit Euler 工 0 implicit Euler 1 | 工 
implicit midpoint | 1 | 1/2 | trapezoidalrule | 1 | 工 
Gauss methods 1 0 Lobatto IIA 1 1 
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IX.7.2 Estimation of the Coefficients of the Modified Equation 


At the beginning of this chapter we gave an explicit formula for the first coef6cient 
functions of the modified differential equation (See (1.4)). Using the Lie derivative 


(Dig)() = 9 (g 态 ( 切 (7.6) 
(cf. (VI.S.2)) and 万 (V) := ,these formulas can be written as 


户 殷 = 由 人 殷 一 于 (Di 全 
户 罗 = 中 切 一 机 (有 万 )G) 一 下 (Dz 户 十 亡 户 )() 


We have the following recurrence relation for the general case. 


Lemma 7.3. 矿 1Pe WU11e7ical 1Ietjpod jas a1 expa118iomn of 1Pe ja (7.3) 1Pen 1 太 e 
Jipctions 方 (7) oftpe maodijied dijerential eqguation (7.7) Sa1 记 


万) 二 0 >》， (Pu wa 


12 ”1 十 .… 十 太一 了 
Were km 过 1Jorall7m. Opserve 1pattperigjt-pand expressio1 oly ivolves 大 ( 
Wi KE < 7. 


Poof The solution of the modified equation (1.1) with initial value y( 信 = V can 
be formally written as (cf. (1.2)) 


区 十 门 ) = + 并 六 -D IF 


和 寺 


Where 严 (7) 三 用 人 ) 十 ja 0 十 .. .stands forthe modiftied equation, and 
jhD = /Di 十 12D。 十 jDa 十 .forthe corresponding Lie operator. We expand 
the formal sums and obtain 


生计 全 y+ 交 人 


2 人 工 和 





where all Km > 1. Comparing like powers of Rin (7.3) and (7.7) yields the desired 
Tecurrence relations for the functions 方 (y). 














To get bounds for || 方 (y), we have to estimate repeatedly expressions like 
(Diog)(y)ll. The following variant of Cauchy's estimate will be extremely useful. 


Lemma 7.4. Fopr analytic jinctions 态 (0) aad 9g(V) we paveJor0<ao<pztjpe 
ES1I1T1G1e 


工 
lPigle < 7 hille lole 


:= max{llg() ye Bo(yo)} and | 


【ee 











Dig|le are dejinred si- 





Lary 
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Proof Forafxedy E 成 - the function Ca 一 09(LV 十 之 访 ls analytic for 
2 二 9\V y 


上 lss 一) wib 订 二 | .illc- Since or ) 一 9 有 ) 一 (Di9g)(y)， 
we get from Cauchy'”s estimate that 





IDig=la(Ol<sz sup la < 


.9 
E |z|<e 0 一 I 的 














This proves the statement. 


We are now able to estimate the coefficients 方 (y) of the modified differential 
equation. 


Theorem 7.S. Zer jy) pe azalytic i Bap(yo),， ertpe Tayior series coefpicients of 
态 e 11U11lerical 111etpod (7.3) pe da1a1ytic 太 BR(yo)， azd assu1ie 1Pat(7.2) a10 (7.9) 
G1e Sa1isjied. 7Tj1em we jayvejortipe coejjpcie11s of1pe 1iodijied di1eretial eqguatiom 


AI 
| 访 O 川 <n2nax(2 人 Jr lsllsR2， 0.3) 


Where7 一 2max(k AN/(2in2 一 1)). 
Proojf Wefizx an index, Say .1, and we estimate (in the notation of Lemma 7.4) 
| 方 | 一 D5 for 了 三 2 翅 


where 0 三 忆 /(207 一 巧 ).This will then lead to the desired estimate for | 万 ||ay>， 
In the following we abbreviate |.||P_(D6 by 六 Using repeatedly Cauchy's 
estimate of Lemma 7.4 we get for Fi 十 .十 应 三 ITthat 


| PPAa Dj， 7 








工 
苹 5 
芋 元 二 | 
上访 : As | jzllus 8 | 和 天 


| 入 








苦 二 


The last inequality follows from |l9l 和 |9l， for 7 和 7 which is an immediate 
consequence of 已 R_(D5(yo) C 万 R_(_1)5(yo). It therefore follows from Lem- 
ma 7.3 that 





1 
| < ez + 下， >》， 5 | ja as 


1 一 2 ”ji 十 .… 十 太一 了 


By induction on7 (1 和 7 < .J) weobtain that || 方 上 和 00j where Dj is defined by 


记 1 
6 学 (2 富 3 0 


i 一 2 ”a 十 .十 司 一 了 
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4 到 
G 1 一 4C | ez 一 1 一 2 一 世 L 
1/zv 1 一 2Imn2N 人 一 In2 








Fig. 7.2. Complex functions of the proof of Theorem 7.3 (7 一 9 三 了 


Observe that Dj is defined for all 7 > 1. We let 0(G) = 2 人 >1 Di6G7 be its generating 
function and we obtain (by multiplying (7.9) with 67 and summing over 了 7 > J) 


9 1 [和 
二 二 








上 +ez9) 一 1 一 0O)， (7.10) 


Where we have used the abbreviations 7 := HUA/6 and dg := 261/ 尺 . 

Whenever et) 和 2 (eforC 天 (20 一 DJ/(7 二 4(20 一 D)) withb = In2 十 257z) 
the implicit function theorem can be applied to (7.10). This implies that 8(C) is 
analytic in adisc with radius 1/y = (2in2 一 1)/(7 二 dg(2In2 一 1)) and centre atthe 
origin. On the disc | 上 | 和 1/v, the solution %(6) of (7.10) with (0) = 0 is bounded 
by In2. This is seen as follows (Fig.7.2): with the function w = -YYC/L 一 00) 
the disc |C| 和 1/z is mapped into a disc which, for all possible choices of y > 0 
andd > 0,liesin lw 和 2m2-1.Theimage of this disc under the mapping 
p(w) definedby ez 一 1 一 2 一 wandb(0) =0is completely contained in the disc 
| 中 乏 了 mn2. Cauchy's inequalities therefore imply 10 乏 了 2 .zand we get 


ra lorlys567<m2.5. 


Sincev = 4+?Y/(2imn2--1)<7TAM.V/Rwithy givenby7 一 2max(k,N/(2in2 一 
1)) and ov <TM,this proves the statement for .7. 














IX.7.3 Choice of A and the Estimation of the Local Error 


To get Tigorous estimates, we truncate the modified differential equation (1.1), and 
We consider 


Y= Fwv( 人， Fwv( 们 =) 上 AP 人 上 + + 人 0.1D 


with initial value V(0) = yo. It is common in 
the theory of asymptotic expanslions to truncate 
the series at the index where the corresponding 
term is minimal. Motivated by the bound (7.8) 
and by the fact that (szZ)z admits a minimum 
for z = (se)-1 (see the picture to the left with 
E 王 0.15), we Suppose that the truncation index 
A satisfies 
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加 
ea 





PNV<h with 加 (7.12) 


Under the less restrictive assumption PN < eho, the estimates (7.2) and (7.8) imply 
for ly 一 zo|| < 有 /2 that 


Imwols w 人 1 rm22 (2 ) 


了 
xw(1+om2> 人 ( 坟 ) 】 < M(1+1659) 


One can check that the sum in the lower formula of (7.13) is maximal for N 三 了 
and bounded by 2.38. For a pth order method we obtain under the Same asSumptions 


(7.13) 





| 入 


EN 一 大 OO S ce， 0.14) 
where c depends only on the method. 


Theorem 7.6. ZLel j 帮 V) pe analytic 碎 BoR(yo)， 1et 1pe coefjicients di) or ipe 
1aetjpod (7.3) pe a1alytic 太刀 R(V0)，a1d assu1ae 妨 af (7.2) ad (7.3) pold. 不下 瓜 
1o/4w 识 7jo 三 民 ][(eM), ipen tpere exisk N = VOD (namely N equal 1o 1Pe 
La18est integer Satishi1g RN < Ro) sucp 1Pat tpe dierence pethween tpe 12U11erical 
SOUU1ioP UV1 三 @p(V0) aad 1pe exact solution ONi(yo) of ipe 1ru1acated miodified 
eqgUa1IO1 (7.77) Satisjies 


| 到 (yo) 一 Za 人 go) < maMe oo 人 


Where7y 三 e(2 十 1.657 十] depends ojiy on tpe metpod (we jnave5 近 7 挟 5.18 
a1d 了 区 31.4Jortpe 71aetpods of72pie 7 7) 


The quotient 二 = MI/Ris an upper bound of the first derivative 户 (y) and can 
be interpreted as a Lipschitz constant for /yy). The condition 灵 < Po/4is therefore 
equivalent to AD < Comst, where Comst depends only on the method. Because of 
this condition, Theorem 7.0 requires unreasonably Small step Sizes for the numerical 
solution of stiff differential equations. 


Proofof7jpeore1z 7.0. We follow here the elegant proof of Benettin 狠 Giorgilli 
(1994). Itis based on the fact that Gj(yo) (as a convergent series (7.3)) and ON hn(Vo) 
(as the solution of an analytic differential equation) are both analytic functions of 九 . 
Hence， 

9 :一 (yo) 一 No) (7.15) 


is analytic in a complex nelighbourhood of 岂 = 0. By definition of the functions 
方 (y) ofthe modified equation (1.1), the coefficients of the Taylor series for @n(Vo) 
and DNn(yo) are the same up to the Ph term, but not further due to the truncation 
ofthe modified equation. Consequently, the function 9( 户 ) contains the factor PN+1， 
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and the maximum principle for analytic functions, applied to g(P)/PN+l, implies 
that 
AN+1 
le(al < 目 maxlgC| for 0<h<e 09 
计 9g(z) is analytic for |z| 和 s. We Shall show that we can take <s = eho/N, and we 
compute an upper bound for ||9(z)|| by estimating separately | 2z(yo) 一 vol and 
12wn(yo) 一 yol. 

The function 瑟 :(yo) is given by the series (7.3) which, due to the bounds of 
Theorem 7.2, converges certainly for |z| 和 已/(4kAM ),and therefore also for|z| < < 
(because 24 和 71 and N > 4, which is a consequence of Po/ 岂 > 和 . Hence, it is 
analytic in |z| 冬 s. Moreover we have from Theorem 7.2 that ||@:(yo) 一 yo|| 么 
zlAMGL+A) forlzl < s. 

Because of the bound (7.13) on Fw(y)，which is valid for y E BRpy/z(yo) and 
for | 和 s, we have | 2wz(yo) 一 yo |z|AM(L+1.657) as long as the solution 
ON,z(yo) stays in the ball BRy/z(yo). Because of sM(I 二 1.657) < 民 /2, which is a 
consequence of the definition ofs,of V 之 4 andof(1 十 1.657) 和 1.857 (because 
for consistent methods / 之 1 holds and therefore also7 之 2/(2in2 一 1) 之 5),this 
is the case for all |z| 和 s. mn particular, the solution ON, z(yo) is analytic in |z| < <. 

Inserting s = eho/V andthebound on |9(z)| 乏 18:(yo) 一 xl 二 2wz(yo) 一 
zoll into (7.16) yields (with C = 2 十 1.657 十 人 


NT+L V V 
oo) < ec 苹 Mrc( 站 一 Mrc( 全) < ]AMCe-n， 
CE/10 


because PNV < ho. The statement now follows from the fact that N < ho/ 及 < 
V 二 1,sSothat 全 太 近 e. epo/. 














Adifferent approach to arigorous backward error analysis ls developed by Moan 
(2005). There, the modified differential equation contains an exponentially Small 
time-dependent perturpbation, but its How reproduces the numerical solution without 
erTOoT. 


IX.8 Long-Time Energy Conservation 


In particular, one easily explains in this way why symplectic algorithms 
give rise to a good energy conservation, with essentially no accumulation 
of errors in time. (G. Benettin & A. Giorgilli 1994) 


Asatfirst application of Theorem 7.6 we study the long-time energy conservation of 
symplectic numerical schemes applied to Hamiltonian systems 7 = .一 IV 万 ( 力 . 
follows from Theorem 3.1 that the corresponding modified differential equation is 
also Hamiltonian. After truncation we thus get amodified Hamiltonian 


瑟 (y) = 克 (y) 十 机 (人 十 .十 jw()， (8.1) 
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which we assume to be defined on the Same open set as the original Hamiltonian 万; 
See Theorem 3.2 and Sect. IX.4. We also assume that the numerical method Satisfies 
the analyticity bounds (7.3)，So that Theorem 7.6 can be applied. The following 
result is given by Benettin 儿 Giorgilli (1994). 


Theorem 8.1，Co7sider a 五 azzltomia1m syste1a with aaa1ytic :万 一 下 (wjpere 
万 C 形 2d)， and apppy a symzplectic Paerical maetjpod @j(V) wii step size 天 还 
加 e 111IeFicaL SOUU1iOP StayS 0 加 e co11Dpacft set 开 C 1jez 1ere exisf Po Ga11d 
N=N(as 太 7T1jeorea 7.6) sucP tat 


刀 (nm) = 巨 (0o) 二 O(e -120) 
瑟 (n) = 五 Co) 二 OU) 





oyer exDoe11ially 1o1g 1iyie intervals 7 万 < ezo/21. 


Pioof We let Ni(yo) be the fow of the truncated modified equation. Since this 
differential equation is Hamiltonian with 妃 of (8.1), 万 (wz(yo)) = 瑟 (yo) holds 
for all times t. From Theorem 7.6 we know that | yn +1 一 和 wp 人 (yn 苹 hmTAMe-zo/ 
and, by using a global A-independent Lipschitz constant for 万 (which exists by 


Theorem 7.5)，we also get 万 (n+ -万 (Bwn(n)) = O(Phe-zo/ 加 .From the 
identity 


训 g) - 攻 oo) = 并 (Bon) -ED) = 并 (io) 一 奋 (pvatw 


和 1 和 1 


we thus get 瓦 (加 ) 一 瑟 (yo) = O(nhe-m/), and the statement on the long-time 
conservation of 万 is an immediate consequence. The statement for the Hamiltonian 
瑟 follows from (8.1), because 吾 H1(y) 十 Ba() 十 .十 AP LIBN() is 
uniformly bounded on K independently of 几 and N. This follows from the proof of 
Lemma VI.2.7 and from the estimates of Theorem 7.3. 














了 xample 8.2， Let us check explicitly the assumptions of Theorem 8.1 for the pen- 
dulum problem d = D, 妃 = 一 sin q. The vector field jp,o) = (p, 一 sin gd)7 is also 
well-defined for complex p and dg, and it is analytic everywhere on C2. We let 开 be 
a compact subset of {(p,q) E 展 2? ; |p| < c}. As aconsequence of | sin dg| elImdl， 


we get the bound 
| E Vc 二 4B2 十 e22 


for ||(p,a) - (poao) 中 和 2Rand (po,qo) E 天. 开 wechoosec 芝 2 及 = 1 
and M = 4, the value ho of Theorem 7.6 is given by Po = 1/4em 盖 0.018 for 
the methods of Table 7.1. For step sizes that are Smaller than 7jo/20, Theorem 8.1 
guarantees that the numerical Hamiltonian is well conserved on intervals [0,7] with 
了 el 之 2.104. 

The numerical experiment of Fig.8.1 shows that the estimates for Po are of- 
ten too pessimistic. We have drawn 200 000 steps of the numerical Solution of the 
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Fig. 8.1. Numerical solutions of the implicit midpoint rule with large step Sizes 


implicit midpoint rule for various step Sizes 几 and for initial values (po,qo) = 
(0， 一 1.5)， (po, 9o) 一 (0， 一 2.5)， (2o, go) 站 (1.5， 一 和 T)， and (po, do) 一 (2.5， 县 站 
They are compared to the contour lines of the truncated modified Hamiltonian 

D2 六 2 

万 (D;,g) 三 二 一 cosd 十 一 (cos(2g) 一 2p2 cos 路 . 

2 48 

This shows that for step Sizes as large as 妨 冬 0.7 the Hamiltonian 万 is extremely 
well conserved. Beyond this value, the dynamics of the numerical method soon 


turns into chaotic behaviour (see also Yoshlida (1993) and Hairer Ngrsett & Wanner 
(1993), page 336). 


Theorem 8.1 explains the near conservation of the Hamiltonian with the Sym- 
plectic Euler method, the implicit midpoint rule and the Stormer-Verlet method as 
observed in the numerical experiments of Chap.I in Fig.I.1.4 for the pendulum 
problem, in Fig.I.2.3 for the Kepler problem, and in Fig.I.4.1 for the frozen argon 
CTyStal. 

The linear drift of the numerical Hamiltonian for non-Ssymplectic methods can 
be explained by a computation Similar to that of the proof of Theorem 8.1. From a 
Lipschitz condition of the Hamiltonian and from the standard local error estimate， 
we obtain 瓦 (和 ri) 一 吾 (ph(g)) = OP+D .Since 吾 (oh( 加 )) = 吾 (oo,a 
Summation of these terms leads to 


克 (yn) 一 瓦 (yo) = O(tPP) for 寺 三 7 几 . (8.2) 


This explains the linear growth in the error of the Hamiltonian observed in Fig.I.2.3 
and in Fig.I.4.1 for the explicit Euler method. 
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IX.9 Modified 上 quation in Terms of Trees 


By Theorem III.1.4 the numerical solution Vi = 2n(yo) of aRunge-Kutta method 
can be written as a B-Series 


于 ( 切 一 乡 十 RAO) 二 肥 oa) 


(9.1) 
未 (ia(VWPp OHaDPPyog) 和 





For consistent methods, i.e., methods of order at least 1, we always have w(。) 一 |， 
So that the coefficient of 凡 _is equal to jy). In this section we exploit this special 
Structure of Bj(V) in order to get practical formulas for the coefficient functions of 
the modiftied differential equation. Using (9.1) instead of (1.3), the equations (1.4) 
yield 


PPg = (oO)- 引 (WO 
Ag = iaV)-a07)+ 让 PP] 9) 


下 (ce 一 a( 7 ) 十 了 ) FF 人) 


Continuing this computation, one is quickly convinced of the general formula 


方 (= 》 一 一 Fr)(y)， (9.3) 


|7|=7 


So that the modified equation (1.1) becomes 
二 帮 I 一 ! 
= 》 一 一 -2(7) Fr)( 纺 ) (9.4) 


with b(。) 一 1 207) =a(7) 一 二 etc.Since the coefficients xc(7) are known from 
Deftnition II.1.7, all we have to do ls to fnd sulitable recursion formulas for the real 
coefficients b(7). 


IX.9.1 B-Series of the Modified Equation 


Recurrence formulas for the coefficients (7) ip (9.4) were first given by Hairer 
(1994) and by Calvo, Murua 多 Sanz-Serna (1994). We follow here the approach 
of Hairer (1999),，which uses the Lie-derivative of B-series and thus Simplifies the 
construction of the coef6cients. 

We make use of the notion of ordered trees introduced in Sect.IU.1.3. For a 
given tree7T we define the set of all sPZzttz7z2gsy as 


SP(7) 一 {9 E O4S7(7T) ; T\0 consists of only one element } (9.3) 


Here, OS7(7) = 08ST(w(r)) is the set of ordered subtrees as defined in (IIL.1.33). 
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Lemma 9.1 (Lie-Derivative of B-series)，Zef b(7) (wiiU 0 册 = 0) and c(7) pe 
1jpe coejpicients of po B-seriey ad 1et yt pe da jpmaal solution or ipe deren- 
fial eguation 18 的 = 瑟 (by(b). 7Tje Lie derivatiye oftje jnction B(cy) wii 
1espect io 1pe vector jield 已 (bo Ma8ain dB-series 


用 本 B(cy) = 互 (Boc yb). 


11s coeHpcienty ae give py 05c( 人 =0 andjJorlr 二 1D7 


Bclr)= > ，c(0)2rN\g). (9.60) 


bcSP(7) 





Fig. 9.1. Splitting of an ordered tree w into a subtree 0 and {0} = w\0 


Prooj For the proof of this lemma it is convenient to Work with ordered trees w E 
O7. Since (7) of (HI.1.31) denotes the number of possible orderings of a tree 
TET,asum > cr:/ becomes > ucorz(w) 开 :人 

For the computation of the Lie derivative of 已 (c,y) we have to differentiate 
the elementary differential 严 (0)(y( 坟 ) with respect to 上 Using Leibniz rule, this 
yields |b| terms, one for every vertex of 0. Then we insert the series (b,y( 坟 ) for 
1 的. This means that all the trees 5 appearing in 已 (bg(b) are attached with a 
new branch to the distinguished vertex. Written out as formulas, this gives 


d /lol c(0) 51 500) 
玫 一 已 (cy 人 坟 ) 三 一 一 一 一 一 一 全 司 (Oov0)VLt 
人 ( ,2 )) 人 z(O0) al(0) 人 2 Z(0) al(0) 人 4 )(%( ))， 
Where >， y ia sum over all vertices of 0, and 0 ov 0 is the ordered tree obtained 
When attaching the root of b with a new branch to 7 (See Fig. 9.1). We choose one of 
the (7) 十 1 possibijlities of doing this, where mn(7) denotes the number of upwards 
leaving branches of O at the vertex 7y. We now collect the terms with equal ordered 
treew = 0ov0,andnotice thatv(0)c(0) = A(O0) with K(O0) given by (IIL.1.32). This 
gives 


d 号 c(0) 00) 
hB(osb) = > Al 人 ( 和 全 5)PC)(GG) 


wEOT 0o~v0 一 ww 





Where 泡 6 is over all triplets (0,7y, 0) such that 0o; 0 = w. Because of <c(w) 王 
A(O)A(0O)(2(7) 十 1), we obtain 
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全 全 


Fig. 9.2. Ilustration of the formula (9.6) for an ordered tree with 5 vertices 





玉 |w| 





15B(cyO) = 并 (<OUD)FC(GG) 


wEOT7 Atw) go 0 一 w 


| 
- 工 生 ( 于 Our\O)PmGo) 


TE OcESP(T) 











Which proves the statement. 





Let us illustrate this proof and the formula (9.6) with an ordered tree hav- 
ing 5 vertices. All possible splittings w = 0 ov ) are given in Fig.9.2. Notice 
that 0 may be the empty tree 0, and that always |0| > 1. We see that the tree 
w js obtained in several ways: 人) differentiation of 天 (O)(y) = y and adding 
有 (wj(y) as argument (ii differentiation of the factor corresponding to the root 
in F(g)() = 大 ( 访 访 (y) and adding EC) = (大 六 (四 Gib differentiation 
of all sin FUO)() = jw( 户 户 月 () and adding FF(。)(O) = Fand finaly 
(iv) differentiation ofthe factor fortherootin 严 (0)(V) = 大 ( 瑚 访 力 (Wandadding 
有 (。)(V) = .This proves that 


act) 二 <O)M 人 7/) +Tc( VOD)TceCY)R)T 2c( 员 小 


For the trees up to order 3 the formulas for Opc are: 





Doc() 二 <c( 包 以) 

Docly) = c 信 MA)+Tc)D) 

Doc(V) = <cV)+2c07)D) 

0 一 的 MD 二 区 EDJeDJE) 


The above lemma permits us to get recursion formulas for the coefficients p(7) of 
the modified differential equation (9.4). 


Theorem 9.2. 矿 ije zetpod By) Sivenpy(9.1)1pejinctions 方 (7) ofipe maod- 
z 太 ed dzerential equation (1.7 Satig 太 (9.3 wjpere tpe real coejpicients DT) da1e 7e- 
cl1sively dejized py pb( 仿 一 0,5(。) 王 1azd 


bi7)=alr) 一》， 二 D7 0(7). (9.7) 


Dere， 有 75 81je (7 一])- 太 ierate oftpe Lie-derivative Do dejijzed 记 Le1a1zaa 9.7. 
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Proof The right-hand side of the modified equation (9.4) is the B-series 已 (b, 7) 
divided by 几 . It therefore follows from an iterative application of Lemma 9.1 that 


太 OO = (85 这 昌 )， 


So that by Taylor series expansion V(t 十 刀 ) 一 十 了 (>)>1 页 D7 1 0) ,where 
4 := V 蚊 .Since we have to determine the coefficients b(7) in Such a way 
that 7 十 妃 三 @(V) 三 刀 (ay)，a comparison of the two B-series gives 
22hi>1 页 0 br) = a(r). This proves the statement, because 90b(r) 一 b(7) 
forr ecT,and80 :bo(r) 一 0fory > |r| (as aconsequence of b(0) = 0). 














We present in Table 9.1 the formula (9.7) for trees up to order 3. 


Table 9.1.， Examples of formula (9.7) 





r=。 | 5 =a() 


兰 大 已 


= 
2 


厂 
厂 
厂 








We next consider the case when asSymplectic method is applied to a Hamiltonian 
system yy = .JIV 万 (四 .Itfollows from Theorem 3.1 that the modified equation is 
again Hamiltonian. What does this imply for the coefficients of (9.4)7 


Theorem 9.3. SuPpPpose 1pat Jor al Bammailtonians 万 (y) te modified vector jie1d 
(29. 和 11U1acated afier az arpitary pomwer or ifiocaly) Pamaziliomiaz. 7TPe1， 


boo) 二 buo)=0 Joral 山 vET (9.8) 


Pioof Let ONi(yo) be the flow of the modified differential equation (9.4),， trun- 
cated after the ji-1 terms. It is symplectic for all 上 and in particular for 上 一 用 . 
As a consequence of the proof of Theorem 9.2 we obtain that ONn(yo) is a Sym- 
plectic B-series 妃 (aw,yo). The coefficients wow(7) are given by (9.7), where bp(7) 
is replaced with 0 for |r| > N. ForwveET with lu 十 lv = V wetherefore have 


buou)= 三 av(uou) 一 aNw_i(uo)， 


Since wwv(7T) = QN_i(7T) for |7| < _ V, formula (9.8) is an immediate consequence 
of Theorem VI.7.0. 














Remark 9.4. Let G = {a :了 一 了 |a( 人 = 1 bethe Butcher group (see 
Sect. IIU.1.3), and consider the mapping 9 : CG 一 下 deftinedby 





S(o) = atuou+aloo 由 al:a(o 
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If we denotebye E G the element corresponding to the identity (ie., e( 信 = 1 and 
e(T) = 0 for |r| > 1, we have for its derivative 


3'(ejp = puov) 二 boot) 


Hence, coefficient mappings bg(7) satisfying (9.8) lie in the tangent space at e(T) of 
the Symplectic subgroup of CG (ie., a E G satisfying (VI.7.4)). This is in complete 
analogy to the fact that Hamiltonian vector fields can be considered as elements of 
the tangent Space at the identity of the group of Symplectic diffeomorphisms (See 
also Exerclises 15 and 10). 


JIX.9.2 了 lementary Hamiltonians 


于 the modified differential equation (9.4) is Hamiltonian, can we find explicit for- 
mulas for 歹 ( 妇 2? Let us start with an easy example, the implicit midpoint rule. Writ- 
ten as a B-series (9.1), its coeffGicients are a(7) 三 21-7| (cf. Exercise 8) so that the 
first coefficient functions (9.2) of the modified equation satisfy 户 (y) = 0 and 


户 切 = 页 (200711 罗 -天 (OP]O)， 99) 
Since /FUy) = .JIV 万 (W), differentiation of 
丽人 = -去 尼 切 (VEO VEO) O.0) 


shows that 户 (y) = . 王 1VDas(y, and we have found an explicit expression of the 
Hamiltonian corresponding to the vector field 户 (四 .Itis recommended to compute 
also 广 (y) and to try to find 丈 5(y) such that 方 (V) = JIVF5( 人 .Such com- 
putations lead to expressions that have been introduced in a different context by 
Sanz-9erna 贸 Abia (1991). They call them cazzo7icalL ele1ze1ta1y Ci1herem1ialy. 


Definition 9.$ (Elementary Hamiltonians). For a given Smooth function 囊 : 
万 一 了 (with open 万 C 月 29) and forr E 人 wedefine the elementary 瑟 azmi 
io1lia1 万 (7T) :万 一 及 by 


刀 (,)(O = 万 ( 轨 ， 互 (r)( 切 = 万 69( 人 (Fn 人 Fn)( 仿 ) (9.1) 


for7 = [ 九 )….,Tm .Here, 天 ( 关 )(V) are elementary differentials corresponding to 
j0) = Y 末 (人 


The expression in (9.10) is nothing else than the elementary Hamiltonian corre- 
sponding to the tree W. Our aim is to prove that, for symplectic methods applied 
to Hamiltonian Systems, the coefficient functions (9.3) of the modified differential 
equation satisfy 户 (y) = JIVEi(, where 五 )(y) is alinear combination of ele- 
mentary Hamiltonians. 
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Lemma 9.6. 天 /ee11a7y G111z1tomia1y Sa1isj 
万 (oo)( 作 十 瑟 ( 人 ou 人 =0 Jorall 如 ET (9.12) 
态 Pa1ticulaz We jpave 万 (uot( 人 三 0 Jorallv ET 


Pioof This follows immediately from the fact that for = | 由,um E 工 
and for w E 人 we have 瓦 Loo) = 五 (HE Eu) Fo)) = 
形 (TV 万) (OPC Fw) = 开 (o)77RP(O, and from the skew-sym- 
metry of ./. 














The trees ovwandwvwowhavethe Same graph and differ only in the position 
of the root. The relation (9.12) thus motivates the consideration of the (Smallesb) 
equivalence relation on 7 Satisfying 


ooUAN UVUOLL (9.13) 


We want to select from each equivalence class, not containing a tree of the form 
o?u, exactly one element. This can be done as follows (cf. Chartier Faou 儿 Murua 
2005): we choose a total ordering on the set 7 that respects the number of vertices， 
ie.,u < whenever |v| < lv|, and we define 





27 = { 人 Urlrcannotbe writtenas7T 王 UVovwwithv<v} (9.14) 


OO 


(for the second line we assume [。,。| < |[]]).Everytreer ETiseither equivalent 
to Some Wowortoatree in7”. This is aconsequence of the fact that as long as 
7 了 三 ovwwith <vu,itcan be changedtovow(what happens only afinite number 
of times). Moreover, two trees of 7” can never be equivalent. 


Lemma 9.7.。 For atree7TE7T”wWejpave 


J VE(r)() =clr)>， 


0O~T 


CO) 人 2， (9.15) 


Were Fr(T,O) 加 eaper ofroof cjpa1zges 1Pat ae 1iecessa1y 10 opiain 0 jjo1zz T， 


Proof We compnute .JIV 瓦 (7)( 四 .The expression 瓦 (7)(y) consists of || factors 
corresponding to the vertices of 7, each of wphich has to be differentiated by Leibniz” 
rule. Differentiation of 所 Cn)( 攻 (cf. Deftinition 9.3) and pre-multiplication by the 
matrix .7 yields F(7)( 四 ). Before differentiating the other factors, we bring the 
Corresponding vertex down to the root. In view of Lemma 9.0 this only multiplies 
瓦 (7)(y) by (一 1)“(tr9), and shows that a differentiation of the corresponding factor 
yields 已 (0)( 四 .Since r E T”, the number of possibijlities to obtain 0 from 7 by 
exchanging roots is equal to o(7)/c(0). This factor has to be included. 
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IX.9.3 Modified Hamiltonian 


We are now in the position to give an explicit formula for the Hamiltonian of the 
modified differential equation provided that the numerical method can be written as 
aB-series. An extension to partitioned methods will be given in Sect. IX.10. 


Theorem 9.8. Comsider a 7U1aerical 11ethod 1Patca1zz pe Write ayaB-seriey(9. 几 
a1d 1Pat is sy1aplectic jpor every Panilionian syste11 = JIV 万 (四 ). 1 modijied 
djere1tial equation 1 加 e1 瑟 a11atliomia1z Wi 


声 (J) = 丽 ( 妇 十 帮 瑟 ( 切 十 甩 丽 ( 仿 十 ， 


WApeye 


历 ( 轨 = >， 0 (9.16) 


(7 
TET*，,|7| = 了 7 


C1d 11e coe 记 cient (7T) da1e 妨 ose of 7Tjeorem 9.2. Notice 11at 加 e SU 加 (9.70) 8 
O11y over 11eegy 胡 7 ay deji1ed 友 (9.7 和) 


-一 


Proof We apply the method (9.1) to the Hamiltonian System，sSo that by Theo- 
rem 3.1 the modified differential equation is (locally) Hamiltonian. It therefore fol- 
lows from Theorem 9.3 that the coefficients bg(7) of (9.4) satisfy (9.8). This relation 
implies 5(0) = (一 DJ)*(r2)b(7) whenever 0 ~ 7T. Inserted into (9.3), an application 
of Lemma 9.7 proves the Statement. 














Remark 9.9. This theorem gives an explicit formula for the modified Hamiltonian 
(for methods expressed as B-series). Since the elementary Hamiltonians 妃 (7)(y) 
depend only on derivatives of 刺 (yj), this modified Hamiltonian is 81opal1y defined. 
For Runge-Kutta methods this provides an alternative approach to the statement of 
Theorem 3.2. 


Forthe Sake of completeness we give in the following theorem a characterization 
of Hamiltonian vector fields of the form (9.4). 


Theorem 9.10. 771e dijerential equation jg = 肆 ( 忆 人 wii = 075 瑟 amt 
io1ia1a Jprallyvectorjields Foy) = JIV 万 (trand om1y 太 


buoo) 十 bo)=0 Jora uvET (9.17) 


Proof The “only if part follows from Theorem 9.3. The “if” part is a consequence 
of the proof of Theorem 9.8. 














IX.9.4 First Integrals Close to the Hamiltonian 


We have Seen in Sect.IX.9.3 that for Symplectic methods the modified differential 
equation (9.4) based on jy) = .JLV( is Hamiltonian with a function of the 
form 
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|r| 一 1 
Fog)= 并 有 co) 全 .18) 
全 
and coefficients c(7) = g(7). In this section we study whether for non-symplectic 
methods a function of the form (9.18) can be a first integral of (9.4). This question 
has been addressed by Faou, Hairer 人 狠 Pham (2004), and we closely follow their 
presentation. 


Lemma 9.11. Zety( 访 pe asolulion oftipe derenfial eqguation (9.4) wjicjp ca pe 
WTitten as P0( 坟 三 ( 六 (人 ). VB tpen pave 


d 


元 如 (e 2() 和 刀 (joc， 2() 


mWjpere 6cb(。) 一 0 ad jpor7eTw 训 上 7 >1 


cm-= 工 CDrr0 2 > ， c(w)b\w) (2.19) 


0~T wET*mSP(O) 


Te jirstsu1a 23 Overall teey01patare equivalent io T (see (9.13 员 da1ad 1Pe seco11d 
SU1I 18 OVEF QUL SDHi8S OF0 as 太 Ze111110 9.1 (gee TDDie 9.2). 


Prooj The proof is nearly the Same as that of Lemma 9.1. The first sum in (9.19) 
appears, because 五 (0)(V) = (7)(y) for0 ~ 7 and because the sum in (9.18) is 
only over trees in 了 ~. 














Table 9.2. Formulas for boc(r) fortreesT7 ET”uptoorder6 


boc(V) = -2c(。)27) 
lc) = 3c(V)) -3c0)DV) 
bc = 4c)2) -4c0)DN) 


pctdy) 可 oOUJATeVMOD He 人 -ac 


met) = 2c(")by) -2c(V)2V) 
be) = 5cCVI) -5c0)CV) 


jc -sdooJHeeyoJHeda 


ct 人 当 cq 示 WO 二 全 MD 十 


RN ac -edGoxwJ-eNOMOD -VIMV) 
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Corollary 9.12. 77e junction 瑟 (c,y) of(9.78) 58 a Jst iptegral of tpe djerential 
edgUatio1 (9. 引 jporevery 瑟 ( 轨 入 aad omy 大 


Ooc(T)=0 JoraL7 ET (9.20) 


Prooj The sufftciency follows from Lemma 9.11 and the necessity is a consequence 
of the independence of the elementary Hamiltonians. TIo prove their independence 
wehave to show that the series (9.18) vanishes for all smooth 瑟 (y) onlyitc(7) =0 
for al 7 E 7 ”. With the techniques of the proof of Theorem VI.7.4 one can Show 
that for every tree7T E ”there exilsts a polynomial Hamiltonian Such that the first 
component of 严 (7)(0) vanishes for all trees except for 7. Differentiating (9.18) and 
employing Lemma 9.7 proves that c(7) = 0. 














Solving the System (9.20). We consider a consistent method, ie., !(。) = 1, and 
We Search for a first integral 万 (c,y) close to the Hamiltonian, i.e., c(。) = 1. 

|r| = 3: The condition (9.20) for7 = W implies (7 ) = 0, which means that 
the method has to be of order two. 

Ir| = 4: There is only one tree in 7T” with four vertices. The corresponding 
condition can be satisfied by putting c(\) = 2(V). 

|r| = 5: The third condition yields 5([[[]]]) = 0. Letting c(N7) be such that 
one of the other two conditions holds, we still have to Satisfy 


ME 人 -2 -0 (9.21) 


This condition is satisfied for symplectic methods, for which (wov) 十 b(uoz) = 0， 

and also for symmetric methods, for which (7) = 0 for trees with an even order. 
|r| = 6: There are four conditions for three c(7) coefficients. Assuming (9.20) 

for trees with less than five vertices, these four conditions admit a_ solution if and 


only 十 
5D(V) 二 + 54( 了) 肝 GMA 4 下 二 玉 
-lag)-3VJ(oCV)+a)) -0 


This relation ls obviously satisfied by every Symplectic method. However， as We 
Shall See Soon, there are Symmetric methods that do not Satisfy (9.22). 


(9.22) 


For various Symmetric methods of order 4 (ie., b(T) = 0forl < |r| < 5) we 
compute the coefficients (7) of the leading perturbation term in (9.4) and also the 
expression (9.22), See Table 9.3. None of the considered methods is Symplectic， 

Surprisingly, the 3-stage collocation method Lobatto JIA (see Table II.1.2 for 
the coeftcients) Satisfies the condition (9.22). This implies for every Hamiltonian 
System (reversible or not reversible) that the dominating error term in the numerical 
Hamiltonian does not have any drift. 

The 3-stage Lobatto IIB method (see Table II.1.4) does not satisfy the condition 
(9.22). We therefore expect a drift in the numerical 有 amiltonian. 


378 IX. Backward Error Analysis and Structure Preservation 


Table 9.3. Coefficients b(7) and expression (9.22) for methods of order 4 


method Y 人 心 了 Y (9.22) 
1 呈 1 1 1 1 
Lobatto IIA 120 240 150 120 240 750 360 0 


1 1 1 工 1 1 1 1 
120 360 720 120 360 720 240 48 











Lobatto IIJB 














Lemma 9.13. For give (7T),T ETsSsatishinsg pb 人 = 0 0(。) = and Jorjixed 
c(。), 1pe Linear syste1af(9.20)Jorc(T),TET” Pas ar11nosf o1e SO1U1io1. 


Pioof Weprovebyinduction on7r ET*that c(7) is uniquely determined by (9.20). 
For this we assume that the ordering on 7 1 such that，within trees of the Same 
order it ls increasing when the numer of vertices connected to the root decreasesS， 
cf. (9.14). 

Let7 三 [说 To EN\{ with| 刻 > 1 anddenote by 大 
the number of 。'S in this representation. Since the treeT o。is again in the Set 7”， 
condition (9.20) yields 


0 一 goc(ro。)=( 有 十 1)c7r)p。) 一 (E 十 1)c(。)b(7) 十. (9.23) 


For mm = 0, no further termas are present and c(7) is uniquely determined by this 
Telation. For 7 > 0, the three dots im (9.23) Tepresent a linear combination of 
c(HU)8(Z) with | 由 < |r| (which, by the induction hypothesis, are already known) 
and of c(C)p(。), where oa E 7T” is the representant in T”* of the equivalence class 
for 7 We use the notation 7' for some tree which is obtained from 7 by removing 
one of the end vertices of mm and by adding it to the root of 7. 

In general we will have7' Eee 7* (so that o =T), andin this case its number of 
end vertices connected to the root is larger than that for 7. Hence, a < 7T, and the 
coefficient c(c) is known by the induction hypothesis. 

I 开 7 和 7 what is only possible 这 = ov with lv = |vl and > v， we 
have 刀 一 巡 ouvandw <uwotice thatw 一 vwisnot permitted for trees in 了 *). 
In this case wehavea =vow ET .Consequently, c(T) = c(wov) is expressed 
in terms of c(vo 凡 ) and known quantities. Applying the same reasoning to wo 让 
and observing that because of > V the tree v has at least as many end vertices 
connected to the root as the tree ,we see that c(w o WwW ) is expressed in terms of 
already determined quantities. 














The expression (9.20) is bilinear in bg and c. Assuming that hy = 三 妃 (b,y) 
Hamiltonian, the mapping b has the Same degree of freedom as c. It is therefore not 
astonishing to have the following dual variant of Lemma 9.13. 


Lemma 9.14. Zet c(r),T ET* pegivena1nd assu1ie c(。) 一 1a1dp00) = 0.7Tjpem， 
Jr Jixed b(。)， 1pe 1ipear syste1a (9.20) Jor blT))T ET JPas alt7aost ole SOLUO1 
Satishyizg b(uWou) 十 bo) 三 0JorauveE 工 
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Pioof By assumption on D, the coefficients g(T),T ETN\T are uniquely deter- 
mined by those for 7T E 72”. The statement is thus obtained in the Same way as 
that for Lemma 9.13 with the only difference that expressions c(。)b(o) and not 
c(a)b(。) have to be studied. 














Theorem 9.1S (Chartier Faou & Murua 2005$). 7je oly 8y11Plectic 1metpod (as 
B-series) 1Pat co1lse1ves 妨 e 忆 aailionian Jor arpitrarmy 殉 (W) Nt1pe exactjJjow ortpe 
di1eretial eqguatio1， 


Proojf HI the method conserves exactly the Hamiltonian，we have (9.20) with 
c(。) = 1 and c(7) = 0for all other trees in 7”*. By the uniqueness statement 
of Lemma 9.14 and the Symplecticity of the method (Theorem 9.10)，we obtain 
b(7T) = 0 for |r| > 1. Consequently, no perturbation is permitted in the modified 
differential edquation of the method. 














A closely related result is given in Ge 多 Marsden (1988). There，general Sym- 
plectic methods are considered (not necessarily B-series methods) but a weaker re- 
Sult is obtained (in fact, they assume that the System does not have other conserved 
quantities than 巨 ( 四 ,and it is shown that the numerical fow coincides with the 
exact How up to areparametrization of tme). 


JIX.9.S Energy Conservation: 了 Examples and Counter- 上 Examples 


It is generally believed that symmetric methods applied to reversible Hamiltonian 
Systemas (Teversible in the sense that 万 (-D,dq) = 瓦 (p,q)) have the same long- 
time behaviour as Symplectic methods. This is true in many Situations of practical 
interest, and we Shall prove this Tigorously in Sect.XI.3 for integrable reversible 
Systems. There are, however interesting counter-examples to this general belief. 
They are taken from Faou, Hairer 儿 Pham (2004). 


了 xample 9.16. Our first example is a modification of the pendulum equation 


万 (p) = 了 2 一 cos9 十 5 sin(20)， (9.24) 
where the additional term sin(2d) destroys the symmetry in q. The Hamiltonian still 
Satisfies 刀 ( 一 p,9) = 瓦 (p,9q). We consider initial values p(0) = 2.5, 9q(0) = 0 with 
suffciently large initial velocity, such that p(t) stays positive for all times and the 
Symmetry D 过 一 D does not affect the numerical solution. The angle d(t increases 
without limit, but the potential is 2r-periodic so that the solution Stays on a closed 
curve of the cylinder 及 x 91. 

We apply the 3-stage Lobatto IIIA and IIB methods to this problem. Figure 9.3 
Shows the error in the Hamiltonian along the numerical solutions. There is a visible 
energy drift of size O(th4) for the Lobatto IIIB method and no drift can be seen on 
this Scale for the Lobatto IIJIA method. To get more inslght into its long-time behav- 
iour, we apply the method with the Same Step Size to a much longer time interval， 
and weplotthe errorin 刀 (pn qu) 十 1 万 5(pn gm where the first perturbation term 
is computed from (9.18) and the linear System (9.20) as 
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Fig. 9.3. Numerical Hamiltonian of Lobatto methods of order 4 for the perturbed pendulum 
(9.24); step size 尹 三 0.2, integration interval [0, 500] 
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Fig. 9.4. Error in 万 (p,9) 二 1F5(p, dg) along the numerical solution of the 3-stage Lo- 
batto IIIA method for the perturbed pendulum (9.24); step Size 玉 = 0.2, integration interval 
[0, 500 000] 


刺 @g= 商 (85000209 - (go+O(O)) 


with the potential V(dq) = 一 cosdg 十 0.2 sin(29) (See Fig. 9.4). Repeating the same 
experiment with halved step Size Shows that there are oscillations with amplitude 
O(P5) and a drift with slope O(15). Consequently, the error in the Hamiltonian for 
the Lobatto IIIA method behaves on this problem like O(jPL 十 态 3). 

Without the term sin(2d) in (9.24) all symmetric one-step methods nearly con- 
Serve the Hamiltonian. 


了 Example 9.17. For polynomial Hamiltonians (V) of degree at most four, the el- 
ementary Hamiltonian corresponding to the tree “NA” vanishes identically. There- 
fore, the condition (9.20) need not be considered for this tree, and the remaining 
three conditions can always be Satisfied by the three c(7) coefficients. This implies 
that, for example for the HEnon-Heiles problem 


1 1 1 
如 (pl,pa,gl,g@) = 了 (D1 十 22) 十 了 (0 十 叱 ) 十 glg2 一 子 @， (9.25) 


the leading errorterm in the numerical Hamiltonian remains bounded by all methods 
of order four. Numerical experiments indicate that in this case also higher order error 
termas are bounded by Symmetric methods such as Lobatto IIJIA and IIB, even 让 the 
initial values are chosen So that the solution ls chaotic. 


了 xample 9.18. A concrete mechanical System with two degrees of freedom is de- 
Scribed by the Hamiltonian 
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基 记 二 站 和 ”年 三 9.26 
饭 介 三 也 六 2 十 可 ( 人 和 一 蕊 十 全 (9.20) 


lg 一 al 
It is a model of a planar Spring pendulum with exterior forces. The Spring has a 
harmonic potential with frequency w (Hooke's law). The exterior forces are gravi- 
tation and attraction to a mass point Situated at w, which has to be chosen So that no 
Symmetry in the q-variables is present. 

The numerical experiments，reported by Faou，Hairer 儿 Pham (2004)，use 
w = 2,a = (-3,-5)7, and initial values for the position q(0) = (0,1)7 Cup- 
right position), and for the velocity p(0) = (一 1 -0.5)7. The pendulum thus turns 
around the fxed end of the Spring which ls at the origin. 

As for the problem of Example 9.16 one clearly observes a drift for the 3-Sstage 
Lobatto IIIB method, and the error in the Hamiltonian behaves like O(tjh4). As 
predicted by the theory of the preceding Section, the dominant error term for the 3- 
stage Lobatto IIJIA method is bounded. There ls, powever a drift already in the next 
term so that the error in the Hamiltonian behaves for this method as O(j 十 态 5). 

Removing one of the exterior forces (graviltation or attraction to aw), the error 
in the Hamiltonian remains bounded of size O(j4) without any drift (even not in 
higher order terms) for both Lobatto methods. 


IX.10 卫 xtension to Partitioned Systems 


Allresults of Sect. IX.9 can be extended to partitioned methods whose discrete flow 
can be written as a P-series. This includes Important geometric integrators Such as 
the Symplectic Euler method and the Stormer-VYerlet Scheme. Interestingly, many of 
the resujlts have been originally presented and proved for this more general case (See 
Hairer (1994)). 


IX.10.1 P-Series of the Modified Equation 


We consider the partitioned System 


六 三 太 D, 9)， d=9(D,9)， (10.1) 


where, in view of an application to Hamiltonian systems,，we use (D, 9) instead of 
(2 2z) for the variables. By Theorem II.2.4 all consistent partitioned Runge-Kutta 
methods can be written as P-series (cf. Deftinition II.2.1) 


( 鸭 S(OROJRROenaOGOe am 


Where the subscript 0 indicates an evaluation at the initial value (Zo, ao). The first 
perturbation term of the modified equation (1.1) can therefore be written as 
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AI 


and, in general, one finds 


)( 户 疙 DJ+(aCP) 一 坟 (Hg)(， 
)(opj 站 DO) +(a(8) 一 二 (gg)(p,9) 





局 | 请 忆 | 哺 





可 sw | (10.3) 
91(D,g) 2 re7TP |rl-j 和 Cr)(D, 9) 
Hence, the modified equation (1.1) is of the form 
地 太一 ! 
加 (2 OECD 有 
4 >_re7P， 与 (7T) (7 ) Pr )D， 9) 


where bg(7) = 1 for Ir| = 1 7) = al7) 一 二 for |r| = 2.For Ir| > 2, the 
coefficients bg(7) can be obtained recursively from Theorem 10.2 below. The proofs 
of the following two results are Straightforward extensions of those for Lemma 9.1 
and TIheorem 9.2, and are therefore omitted. 

Lemma 10.1 (Lie-Derivative of P-series). Let pg(7) (wii 0) =D 加 ) =O) and 
c(7) pe ipe coeHjicients of ppo P-seriey anad 1et (pg(b) pe aormaal so1ution of 
1je cjerential equation Pa) = 已 (2 (pq 的 )) ie, (70. 估 The Lie 
derivatimve of jpe jxnctiom P(c， (D， dg)) Wiip yespect io tipe vector jield 己 (b， (D， 9)) 好 
0G8a110 0 P-Series 


d 


和 PleG,ab) = Place ,al) 
11s coefjcienfs ae giren py Boc( 加 ) = Doc( 加 ) =0,andjorlrl 二 1D 
Bocr)= > ，c(g)2rN\b)， (10.5) 
OcSP(7) 











Were, analogotsiy 1o (9. 和 人 9P(T) denotes 1pe sef ofsplitti8y of7T E 7 
In formula (10.3), 加 E SP(7) defines a splitting only 让 7 E 7TP， and 
lo es SP(r) only 让 7 E 7TP. We therefore have Doc(。) = c(0)0(。), Doc(o) 三 
c(0o)p(oej, and as examples for trees of order 3 
ooc(A) co)bA)+2c(A)2o)， 
Die(V) = co)DV)+TeCI Jo)+cCA 
Theorem 10.2. 矿 ie mretpod (Di,q1) = 21(Do,dqo) ca pe written ay (70.2) 1pe 


1odijied djerential equation mg give py (70. 逢 wjpeye tpe real coefjicientsy bg(7T) ae 
1ecu1sively dejizzed py pg 加) = 0 ) =0 7) 三 1jorlrl 三 1 anad 





| 





加 
br)=alr) 一》， 0 br) Jor 7re7P (10.6) 


=2 














ere， D7 de1otes 1e iterate of1ijpe ZLie derivafive Oo dejined 态 Le1az1za 70.1. 
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了 xample 10.3. The symplectic Euler method 


DPn+l 三 Dn 十 几 FDnL， dn)， qdn+1 三 9n 十 Pg(Dn+1l， dn) (10.7) 


is a partitioned Runge-Kutta method (all = 一 1 011 = 一 0, 0 一 7 一 1) and can 
therefore be expressed as a P-series (10.2). From Theorem JI.2.4 we get its coef- 
clients: 

1 iall vertices (different from the roob are black， 

a(T) 二 

0 otherwise. 
From Theorem 10.2 we can compute the coefficients b(7) of the modified equation 
(10.4). They are given in Table 10.1 for the trees with a black root. Since w(T) does 


not depend on the colour of the root of r, the Same holds for the coefficients p(7). 
Hence, we do not include the values of b(7) for trees with a white root. 


Table 10.1. Coefficients bg(7) of the modified equation for symplectic Euler (10.7) 


人 | 


br) | 1 1/2 -1l/2 1/6 -1/3 1/6 1/3 -1/6 -1/6 1/3 














We know from Theorem 3.1 that the modified differential equation (10.4) of a 
Symplectic method applied to a Hamiltonian System 


is again Hamiltonian. 


Theorem 10.4. yxppose 1pat jor al1l separaple Pammailionians 丈 (pq) = 了 (D) 十 
D(a) 1pe 1rodijiead vector jield (70. 和 也 1runcated aper an arpitrary power of 几 人 8 
(Localiy) Famatzlto7ia11. 7jem We jave 


bl(wovw) 十 pvoz) = 0 ucETP VETP (10.9) 


Jor tyeey Weye 11ei8Hhpoxrig ye1ticesy ave dijjerent coloxry. 
克 寺 有 (iocal1y) aazltomnian jor al 瑟 (Pp,q)j, ten (70.9) polds 1jorallu eE 7TP， 
UE 7 ad additiomally We jave 


b(7T) zs idependent oftpe coloxr ofthe Joof of7T E 7 (10.10) 


不 让 有 (localy) Famailonian Jor 1 五 (9g) = io7Cp 二 cp 十 DC (ww 
Sy11I1111E117ic IC11IX C) 1 态 e1l We Pave 


bloot) 二 buoo)=0，buoov) 一 puoo =0 由 VE TIN (10.11) 
(See gect 1 7AITJortjpe dejiaition orTNpn adoo 示 











The proof is the Same as for Theorem 9.3 and therefore omitted. 
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IX.10.2 Elementary Hamiltonians 


We have already seen in Example 3.4 that the modified Hamiltonian of the Sym- 
plectic Euler method is composed of expressions Such as 媚 p 万 v， 感 pp( 互 五)， 
万 pg( 克 本 ), etc. These will play the role of elementary Hamiltonians for parti- 
tioned methods. In the following definition, the elementary differentials 严 (7)(2,9) 
correspond to the partitioned system jp, 9q) = 一 瑟 v(p,q),9(D,9) = 万 (D,9). 


Definition 10.5. For a given function 万 :万 一 了 (with open DC 了 R2xR9dand 
forr E 7P we define the elememtiary 再 aailtonian 万 (7T) :万 一 及 by 


瑟 (*)(,9) = 五 (o)Ww,9q) = 五,9) 


0 万 (D,9) 
瑟 (7)(,9) 二 一 (公国 Fo 人 
Where7T 三 [Urol Up or7 三 [wuUWmy ol .)Ul， with trees 


2 所 3 and Wi E 直 
Examples of elementary Hamiltonians are 
万 (= 瓦 丽 P)= 也 肋 
五 (V) 三 五 pp 万)， 刀 (\W) 所 一 五 pg( 瑟 w 瑟 ph)， 刀 () = 瑟 oo( 瑟 五 p) 


We notice that, in contrast to Sect. IX.9.2, non-vanishing elementary Hamiltonians 
exist for trees with two vertices. 


Lemma 10.6， Eee1tfa7y 瓦 G112z11omia11y Sa1isj 
五 (uov)(p,9q) 十 瓦 (ov(pq) =0 JorueTP andve7P，(10.12) 
G11C 1jey do 1of depe1d o1 1Pe coloxr of ipe 700 


Prooj The independence of the colour of the root is by definition，and formula 
(10.12) ls proved in the Same way as the statement of Lemma 9.0. 














The conditions (10.9) and (10.10) define relations between the coeffcients (7) 
of a Hamiltonian Vector field (10.4). The previous lemma shows analogous relations 
between elementary Hamiltonians. This motivates the conslideration of the following 
equivalence relation on 7 己 (Hairer 1994). 


Definition 10.7. We denote by ~ the Smallest eqgwivale1ice elation on 7 which 
Satisfies the two properties 


e 人、~u ivandvwareidentical with the exception of the colour of the root; 


e ouUou forvweE7Pmandvc755 
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二 


Fig. 10.1. Groups of equivalent trees of orders up to three 


Equivalent trees of orders up to three are grouped together in Fig. 10.1. We can 
change the colour of the root, and we can move the root to a nelghbouring vertex 下 
it has the opposite colour. 

JIn the case of separable Hamiltonians, one has to consider only trees for which 
neighbouring vertices have different colours. This impjlies that the first condition of 
Deftinition 10.7 is empty. The Second condition means that the root can be moved ar- 
bitrarily in the tree without changing the equivalence class. For this Special Situation， 
equivalence classes have been consldered already by Abia 多 Sanz-Serna (1993) and 
are named “bicolour (unrooted) treeS”. 


Similar to (9.14) we Select representatives from the equivalence class as follows: 
we fx a total ordering on the Set 7 that ( respects the number of vertices, and 
(D) ls Such that no tree ls between trees that differ only in the colour of the root. The 
ordering of Fig. 10.1 is Such a possible choice. We then define 

丰 7 cannot be written as 了 一人 ovwwith2 <V， 
4 区 } 臣 0 also not 过 the colour of the root is changed. } 





(10.13) 
Wefurtherlet 7 玉 =TPEnTITP and7TP =TPnTP 
Lemma 10.8. Foratree7TE7TP*weHpave 
OF(7) (-D“*(0g) 
于 至 一 (0 
殉 (P, 9) a(7) 5 (9)(P, 9)， 
0~T;geETP。 (10.14) 
五 (7T) (D“*(rg) 





2 F(OOg) 


0~T;,OETP， 5(O 


Were K(T,0) 58 tpe 71iper ofyoof cjpa1ages 妨 atf ad1e 11ecessay 10 0D1a 了 0 oma 7T. 











The proof is the Same as for Lemma 9.7 and therefore omitted. 





Weare now able to give the main result of this section. 


Theorem 10.9. Co7zsider da 7PU1aierical 1ietpod 1at ca1z pe Wiitte1l ad8 Gd P-yeriey 
(70.2) ad 1at 1 Sy1Dlectic Jor eve1y Famatlionaian (70.8) 716 1iodijied dijheren- 
11QL eqgUa1io1 13 态 e1l Ga1111l1o12ia10 Wi17 


互 (2， dg) 亏 : 五 1(D， dg) 本 彤 石 2(D， 9) 未 1 五 3(D， 9) 下 和 ) 
WHeye 


有 pg9- 寺 anwgh 00.5 
TETPY ,|7|=7 


a11d 1pe coejjicienfsy b(7T) ae tbose of 7Tjeorem 710.2. Notice 1pat 万 j(D,9) Jom 
(70.75) 18 idependemt ofwpetper we Su overtreesT or 75 


Se 
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Prooj This is the same as for Theorem 9.8. 





Ithe method (10.2) is known to be Symplectic for separable Hamiltonians only， 
and ifit is applied to (z,9) =(D) 十 UV(d),the statement of Theorem 10.9 is stil 
valid. In this situation 瓦 (7)(p,d) vanishes 这 a vertex of r has sons with different 
colour (it then contains a factor 已 po = 0). 


了 xample 10.10. Consider the 2-stage Lobatto IIA - HIB pair (cf. Table 工 .2.1)， 
which is the natural extension of the Stormer-Verlet Scheme to non-Separable prob- 
lems. We compnute the coefficients ao(7) from Theorem II.2.4, and p(7) from The- 
orem 10.2. The result is given in Table 10.2. Notice that w(7) and (7) are both 
independent of the colour of the root. Theorem 10.9 then yields 


区 六 2 
豆 二 下 卡 末 (G 感 一 媚 ov 万 2 十 二 区 (10.160) 


for the modified Hamiltonian. Since the method is Symmetric， 万 is in even Powers 
of 凡 . The next non-vanishing term requires the consideration of trees up to order 5. 
Table 10.2. Coefficients wa(7) and bg(7) for the Stormer-Verlet Scheme (Table II.2.1) 
可 E 要 本 本 本 克 的 本 
ar) | 1 12 12 1/2 1/4 1l/4 1/4 1/4 0 1/4 
br) |1 0 0 16 -12 -ll2 LU/12 1/12 -1/6 1/12 

















Remark 9.9, the characterization of Symplectic vector fields (10.4), and the re- 
Sults of Sect. JIX.9.4 can be extended to the case of (partitioned) P-Series. We Te- 
nounce of giving all the details here. 


TIX.11 Exercises 


1. Change the Maple program of Example 1.1 in such a way that the modified 
equations for the implicit Euler method, the implicit midpoint rule, or the trape- 
Zoidal rule are obtained. Observe that for Symmetric methods one gets expan- 
Sions in even powers of 妨 . 

2. Write a Short Maple program which, for Simple methods Such as the Symplec- 
tic Euler method,， compnutes Some terms of the modiftied equation for a two- 
dimensional system = jp,d,d = 9(p,q). Check the modified equations of 
Example 1.3. 

3. Prove that the modified equation of the Stormer-Verlet Scheme ([L.1.15) applied 
to 了 = 9g(V) is a second order differential equation of the form y = gz(7 切 
with initial values given by (0) = yo and (0) such that ZJ = 妨 holds. 


11. 


12. 


13. 
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届 L Taylor expansion shows that for a smooth function V( 轨 Satisfying V( 三 
Zn We have 
(1+ 入 克 届 二 有 4 宇 5 JJzD = 9(7) 
12 360 ， 
where D represents differentiation with respect to time. 
Warming. In general we do not have that Vy(t 如 ) 三 加， 


. Prove that for p-reversible differential equations the elementary differentials 


Satisfy 

Fr)(g =(-D7PF(r)()， 
Use this to give an alternative proof of Theorem 2.3 for the case that the method 
ls Symmetric and can be expressed as a 忆 -Series. 


. Find a first integral of the truncated modified equation for the Symplectic Euler 


method and the Lotka-Volterra problem (Example 1.3). 

ZL With the transformation D = exp 已 , dg = exp QQ you will get a Hamil- 
tonian System. 

Resul 7T(D,d) = TD,9) 一 P((P 二 do) 一 8 一 100+2np+8hm d) /4. 


(Field w NUjhoft 2003). Apply the Symplectic Euler method to the System with 


Hamiltonian 妃 (2,9) = In(a 十 切 十 In( 十 9q). Compute the modified Hamil- 
tonian and prove that the series converges for sufficiently Small step Sizes. 
万 The method conserves exactly 7(D,q) = (十 D)(8G 十 四 .Find linear two- 
term recursions for {Dn} and {qn}, and use the ideas of Example 1.4. ReszL 





镶 了 90 一 
瓦 (D,g) 三 瑟 (P, 9) 
全 有 (天 十 ]) 
.Compute Ooc(7) forthetree7 = |[[7j,7] of order 4. 


. For the implicit midpoint rule compnute the coefficients a(7) of the expansion 


(9.1), and also a few coefficients b(7) of the modified equation. 
Resut al7) = 27 5。) = 1 7) = 0 57) = alr) -1/7y(r) for 
|r| = 3. 


. Check the formulas of Iable 9.1. 
.Consider a differential equation y = jy) with a divergence-free vector field， 


and apply a volume-preserving integrator. Show that every truncation of the 
modified equation has again a divergence-free vector field. 

ZL Adapt the proof by induction of Theorems 2.3 and 3.1. 

Consider explicit 2-stage Runge-Kutta methods of order 2, applied to the pen- 
dulum problem d = 2,D = 一 Sing. With the help of Exercise 2 compnute 
户 (p,dq) ofthe modified differential equation. Is there a choice of the free para- 
meter cz, Such that 户 (2, 9q) is a Hamiltonian vector field? 

Find at least two linear transformations po for which the Kepler problem (1.2.2)， 
written as a first order System, is DO-Teversible. 

Consider the 天 epler problem (1.2.2), written as a Hamiltonian System ([.1.10). 
Find constants WM and 尺 such that (7.2) holds for all (p, 9) E 了 月 4 satisfying 


lpl<2 and 08<lcll<1.2. 


388 IX. Backward Error Analysis and Structure Preservation 


14. (McLachlan 儿 Zanna 2005). Consider the RATTLE method (Algorithm VI.5.1) 
applied to the Euler equations (VIH.3.10) of the free rigid body，written as 
1 = 帮 力 . Prove that the modified differential equation is of the form 


8 三 (1 十 j2sa( 人 十 研 sa(0) 十 …) (11.D 


where the scalar functions sk(V) depend on V only via the Casimir function 
C(g= 好 十 邮 十 好 andthe Hamiltonian 瓦 () = 寺 (8/ 五 十 妇 / 素 十 好 /3). 
Consequently, all sk(y) are constant along solutions of the Euler equations. 
ZL Since C(V) and 万 (y) are exactly conserved by the numerical method 
(See Sect. VJIL.3.3), the modified equation ls a time transformation of the origi- 
nal System. The special form of the functions sx(y) follows from the fact that 
RATTLE ls a Poisson integrator (Theorem VJI.3.11) and from a transformation 
to canonical form as in Theorem 3.3. 

15. (Murua 1999). Let @j(V) = 下 (a,y) be given by a B-series and denote with 
b(T) the coefficients of the corresponding modified differential equation，cf. 
formula (9.4). Prove that the coefficients of the mth iterate GZ(V) 三 妃 (o" ,yy) 
Satisfy 

an(7) 王 m br) 十 mn2c(rm) for 7 ET 了， 


where c(7T,m) is apolynomial of degree |r| 一 2 in 7. 
ZE This follows from the Taylor series V() = WO0) 十 1 (0) 十 .forthe 
solution of the modiftied differential equation. 

16， With the help of Exercise 13, give an alternative proof of Theorem 9.3. 

页 人 开刀 (ay) is symplectic, also 妃 (ao",V) is symplectic and its coeffGcients 
thus Satisfy (VI.7.4). 

17. (Murua 1997). Find a one-to-one correspondence between the equivalence 
classes of 7 忆 (corresponding to ~ of Definition 10.7) and oziemted jee treeys 
(ie., trees without a distinguished vertex (Toot), but with oriented edges),， see 
Fig. 11.1. 


. 人 
OA 


Fig. 11.1. Oriented free trees up to order four 


Chapter 入 . 
Hamiltonian Perturbation Theory and 
Symplectic Integrators 


Perturbation theory is in fact an outgrowth of the necessity to determine 
the orbits with ever greater accuracy. This problem can be solved today， 
but in what is for the theoretician a rather disappointing way. With mod- 
ern calculating machines, one is now able to compnute directly results even 
Imore accurately than those provided by perturbation theory. 

(J. Moser 1978) 


... allows computer prediction of planetary positions far more accurate 
(by brute computation) than anything provided by classical perturbation 
theory. In a very real sense, one of the most exhalted of human endeavors， 
going back to the priests of Babylon and before, has been taken over by 
the machine. (S. Sternberg 1909) 


In this chapter we Study the long-ttme behaviour of symplectic integrators, combin- 
ing backward error analysis and the perturbation theory of integrable Hamiltonian 
Systems. 







backward 
10IOCi1ied 二 erTOL anal，_ numerical 
DTODle11 Solution 
numerical | 
method error ? 
Y 
original exXact 
iu 3 芭 林 
problem Solution 
perturbation 个 5 
theory 人 7 
GDD1OXI11G1e approximate 
一 -一 
DTOD1e11 Solution 


During the 18th and 19th centuries, Scientists struggled for the integration of com- 
plicated problems of dynamics, with the main aim of solving them analytically by 
“quadrature” But only few problems could be treated Successfully im this way. In 
cases Where the original problem could not be Solved, much effort was put into Te- 
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placing it by an integrable approximzate pfople, by using and developing perturba- 
tion theory. Thereby, a rich arsenal of very ingenuous theories has been discovered 
Since the 19th century. 

In the 1960s and 1970s, the enormous progress of “calculating machines”and 
numerical software allowed many of the original problems to be solved with extreme 
accuracy, So that for the first tme numerical integration methods Superseded analyt- 
ical perturbation methods in the computations of celestial mechanics (See the above 
citations). Since then, the further increase in computing Speed has allowed problems 
to be treated on larger and larger time Scales, where huge amounts of errors are ac- 
cumulated and need to be understood and controlled. In the Spirit of backward error 
analysis, these numerical errors are interpreted as those of a 11odijed Prople, for 
the study of which perturbation theory is once again the appropriate tool. 


X.1 Completely Integrable Hamiltonian Systems 


JIntegrable Hamiltonian Systems were originally of interest because their equations 
of motion can be Solved analytically. Their interest in the present context lies in the 
fact that their fow is Simply uniform motion on a Cartesian product of circles and 
straight lines in Suitable coordinates, and that many physical Systems can be viewed 
as perturbations of integrable Systems. 


和 X.1.1 Local Integration by Quadrature 


M. Liouville a fait voir qu'il fallait que toutes les combinaisons (a, G) 
des intEgrales trouvkes fussent nulles. (上 .Bour 1859) 


One of the great dreams of 18th and 19th century analytical mechanics Was to Solve 
the equations of motion of mechanical Systems by “quadrature ,that ls, using only 
evaluations and inversions of functions and calculating integrals of known functions. 
In this Spirit Newton's (1687) equations of motion of Kepler s two-body problem 
were Solved by Joh. Bernoulli (1710) and Newton (1713)，see Sect.1.2.2. Eulers 
(1760) solution of the problem of the attraction of a particle by two fixed centres， 
and Lagrange's (1766) study of motion of a particle in a field with one attracting 
centre and under an additional constant force were among the important achieve- 
ments of the 18th century. The three-body problem, however Tesisted all efforts 
aiming at an integration by quadrature, and though it continued to do so, this prob- 
lem Spurred the development of extremely useful mathematical theories of a much 
wider scope throughout the 19th century, from Poisson to Poincare via Hamilton, Ja- 
cobl, Liouville, to name but a few of the most eminent mathematicians contributing 
to analytical mechanics. 
Consider the Hamiltonian System 


9 8 末 
了 人 (1.1) 
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with d degrees of freedom: (p, q) E 了 月 4 x 及 4. We try to find a symplectic transfor- 
mation (p, 9q) 一 (Z,y), such that the system has a more amenable form in the new 
coordinates. In particular, this is the case 夺 the Hamiltonian expressed in the new 
Variables， 


刀 (,g) 三 天 (Z)， (1.2) 
does not depend on V. Since 呈 三 0, the transformed System then becomes (re- 


call the conservation of the Hamiltonian form of the differential equations under 
Symplectic transformations, Theorem VI.2.8) 


zz-0，7=wlz)， (1.3) 
with w(zZ) 三 荣 (z). This is readily integrated: 
2Z 的 =Zo0，g 人 的 三 加 十 wzo)t 


As we recall from Sect.VI.$, a Symplectic transformation (D,q) 上 (Z,V) can be 
constructed via a ge1eratizzg jxnction 9(zZ,q) by the equations 


O9 DO9 


If(po,ao) and (zo,yo) arerelatedby (1.4), andi925/0zpd is invertible at (zo, qo)， 
then the equations (1.4) define a symplectic transformation between neighbourhoods 
of (20， da0) and (Z0， 20). 

The equation (1.2) together with the Second equation of (1.4) give a partial dif- 
ferential equation for 5, the 瓦 aziz1to1 一 /acopi eqguatiom 


刀剑 人 0) = Ka 


IfS(z,d) is asolution of such an equation (for some function 开 ), then (1.3) shows 
that zi 三 玉 (D,q)G=1. ,aq)asgivenimplicitly by the second equation of (1.4)， 
are first integrals of the Hamiltonian System (1.1). Moreover, these functions Fi are 
170 170VOUUNOo1, Which means that their Poisson brackets vanish pairwise: 

Q . 


) 


天 站 三 0 了 7 三 
{ 了 J 


This is an immediate consequence of the definition { 书 G} = VFTJ-IVG ofthe 
Poisson bracket and of the symplecticity of the transformation (the left upper block 
of .J-1is 0). 

Conversely, it was realized by Bour (1855) and Liouville (1835) that a Hamil- 
tonian System having da first integrals in involution can /ocally be transformed to the 
form (1.3) by “quadrature”. This observation is based on the following completion 
Tesult and its proof. 


392 X. Hamiltonian Perturbation Theory and Symplectic Integrators 


Lemma 1.1 (Liouville Lemma). Ze 斑 ，... ,Eape s1ioo 妨 Jea1-VaLued Jiactio1zy， 
dejiped 六 a 1eigjppborrpood of (po,qo) E 形 4 x 了 9. Suppose tpat 1Pese Jinctions 
C1e 胡 ivO1Ution (Le., G11 Poiysom Drackety { 有 7 一 0) da1d tpat 加 eir g1adie1aty 
C1e 1iearly idepe1tdentar(Do,90). 7T1jem 1 加 ee exist S1100 太 Jpctio1g G1 ,Gu， 
dejipzed om some 1eigjpoxrpood of (po,q0)， SucP tat 


(下 FoG1D Gu (D9 呈 (2 NaSy1iplectic tra1zs1o7aatio7. 


Proof Let 焉 三 (, 区) The linear independence ofthe gradients Vim- 
plies that there are d columns of the dg x 2d Jacobian OFV/a(p,9) that form an invert- 
ible dxawsubmatrix. After some suitable symplectic transformations (See Exercise ]) 
we Imay asSume Without loss of generality that zy = OF/Op is invertible. By the 
implicit function theorem, we can then locally solve z = 斑 (P, q) for D: 


2 = P(z,g) with partial derivatives 已 =， 肠 = 一 瑟瑟 . 
The condition that the Am; are in Involution, reads in matrix notation 
伙 5 
Zpzr 一 ct 一 0 . 


Multiplying this equation with Zoo 1 from the left and with 攻关 7 from the right we 
obtain 
人 
一 已 十 瑟 =0， 


So that P = OP/ao is symmetric. By the Integrability Lemma VI2.7,，P(z, 9) 
is thus locally the gradient with ne ， d of Some 9(z,d) (which is 
constructed by quadrature). Moreover, 2 蕊 六 交 = = zz 三 2 1 is invertible. The equa- 
tions (1.4) define a symplectic transformation (P, q) 哺 > 人 ,2),， and by construction 


2 三 下 (0D,9). 


I, in a Hamiltonian System with w degrees of freedom，we can find ac inde- 
pendent first integrals in involution 妃 = 辣 ,z2 .Ed then Lemma 1.1 yields 
a Symplectic change of coordinates,，constructed by quadrature，which transforms 
(1.1D) 17ocaliy to (1.2) with 开 (Z1) ,7Zd) 一 21. 














了 xample 1.2. Consider the Hamiltonian of motion in a central field， 


刀 =3(o1 二 D2) 十 V(r) for 7 一 qz 十 9q2 ， 


with a potential V(7) that is defined and smooth for ” > 0. The Kepler problem 
corresponds to the Special case V(r) = 一 1/r, and the perturbed Kepler problem to 
Vr) = -1 人 r -NA/(3r3). Changing to polar coordinates (see Example VI.5.2) 


gf7rcosy Dr COS 0 Sin % D1 (L.5) 
02 rsino / Do 一 Sin% 7COS Da / 


thlis becomes 


X.1 Completely Integrable Hamiltonian Systemas 393 


1 2 
媚 (or pe;m p) 二 了 ( 十 才 ) 十 Y(m) 

The system has the angular momentum 二 = po as a first integral, since 已 does 
not depend on p. Clearly, { 万 ,了 } = 0 everywhere. The gradients of 妃 and 也 
are linearly independent unless both pr = 0 and p2 = 73W'( 站 .By inserting 22 一 
27( 瓦 -=-V(m) andeliminatingrthis becomes acondition ofthe form aw( 万 , 卫 ) = 0， 
which for the Kepler problem reads explicitly 二 2(1 十 2 万 72) = 0. The conditions 
of Lemma 1.1 are thus satisfied on the domain 


三 {(Dr Deym P) ; 7 > 0， Q 人 (已 , 卫 ) 和 0} 


The equations zl = 五 三 去 22 十 D2pr2) 十 Y(m),za = 二 = Po can be solved for 











入 三 土 V2( 瑟 -TY( 门 一 Z2]r72， po = 工 ， 
and pr = 05/6r,po = 95/0p with 








SUPn 风 = or 寺 厂 VD 二 do， 


The conjugate variables are 
0O9 人 
册 = 呈 = 圭 dp ， 
5 五 “jhVvV55 这 四 元 万 
各 区 全 
D9 加 /p 请 
ro V2( 万 一 TO)) 一 72]02 


This defines (locally) the transformation (Pr, po op) 上 (21;,22),91,W2). In these 
variables, the equations of motion read Zi1 三 0, Za 三 0,V1 三 1, ya2 三 0. Over any 
time interval where pr(t) does not change sign, solutions therefore satisfy 











(1.0) 


























7(tl) 1 
夺 1 E to 一 二 QD ) 
rtto) V2( 吾 一 YUO)) 一 Z27]02 


RE ZL] 
的 -同人 7 人 4 


(1.7) 











X.1.2 Completely Integrable Systems 


Lemma 1.1 appears as a powerful tool for an explicit solution by quadrature. How- 
ever, because of its purely local nature this lemma does not tell us anything about 
the dynamics of the System. This was not a concern at Liouville's time, but the first 
Tigorous non-integrability results by PoincareE (1892) put a definite end to the hope 
of being eventually able to construct explicit analytic solutions of most equations of 
motion by quadrature, and shifted the interest to understanding the glopa/, qualita- 
tive behaviour of dynamical Systems. 

Lemma 1.1 can be globalized by a procedure Similar to analytic continuation 下 
the conditions of the following definition are Satisfied. 
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Definition 1.3. A Hamiltonian system with Hamiltonian 媚 : AM 一 及 (CCM an open 
Subset of 及 4 x 玲 四 ) is called comzpletezy 加 tegraple 计 there exist smooth functions 
下 三 万 ,FEM 一 下 withthefolowing properties: 
1]) 媚 .Foareininvolution (ie.,all {Ei,z =0) on7I. 
2) The gradients of 说 ,... ,zaarelnearly Independent at every point of AM . 
3) The solution trajectories of the Hamiltonian Systems with Hamiltonian 忆 ; (z 一 
1,...,a) exist for all times and remain in AM. 


Obviously, all the Hamiltonian Systems with Hamiltonian Pi; (2 三 1 ...,a) are 
then completely integrable, and So there will be no mathematical reason to further 
distinguish 妃 = 三. Wenote that condition (1) of Definition 1.3 implies that all 万 7 
are first integrals of the Hamiltonian System with Hamiltonian Pr, and that the fows 
o 半 of these Hamiltonian Systems commnute: o 半 O 0 ] -- 0 ] 。 o 半 for al z,7 and 
all 上 seE 开 ;see Lemma VII.3.2. 


Forz = (zi) E 及 4 we define the level set 


Theorem 1.4. SUPPpose 11at 下 EM 一 阴 satsA 1je comditiony of De1jizi- 
11011 7.3. 4SsU111e 1101 Ma 718 co11lected (ad 1101-e1D1y) JorallzQ1eigPporpood 
ozZ0 E 下 4 77jpem om so1re meigjhpoxrjpood 已 of zo, 1Ppere exisfs da sy1aplectic and 
SUHTectiVve 11IGDD1108 


e :已 xR2 Us 人 (es 


ZE 已 
1pat iearizey joral7 三 1 .dt1jpejiow o 装 Of1pe syste11a Witp 瑟 a11atltiomia1z 
六 (9g) = el, 力 ，ten (9g) =elz,y+tei， (1.9) 
Wjere ei 一 (0 .10) 和 六 太 e ivectoror 了 4 


Since e is Symplectic, e ls a local diffteomorphism. Its local inverse ls a trans- 
formation as constructed in Lemma 1.1. However, (2, q) can have countably many 
discretely lying pre-images (z,V)， so that e-1 becomes a multi-valued function 
The situation is analogous to that of the complex exponential and logarithm. The 
following example illustrates that this analogy is not inclidental. 


Example 1.S5，Consider the harmonic oscillator ie., dg = 1 and 瑟 (p,d) = 二 (D2 十 
d2). For z = 苯 2, wehave e(Z,y) = (rcosyrsiny). 


Pioofof7jpeorem 1.4 Wefix (po,qo) < Ma,andinaneighbourhood U of (po, ao) 
we consider a Symplectic transformation 


《一 (下 0G1 Gd) : (D,9) 王 (7Z,9) 


as constructed in Lemma 1.1. We have (po, qo) = (zo,yo) where we may assume 
2 = 0.To everyv = (ww) E 陈 4 we associate the Hamiltonian 
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Zu 一 VE 十 .十 VE 
and note that, because of the commnutativity of the fows o 半 , the fow of the System 
with Hamiltonian Zv equals 
1 df 
oo 一 op oogloo : 
In the neighbourhood UV of (po, 9qo), the system with Hamiltonian Fu, is transformed 
under the symplectic mapping to 


外 三 全 ”水 二 5 
Hence, the following diagram commnutes for (p,q) E UV and for sufficiently small tw: 


(P， 9) 人 2tv (P， dg) 


让 必 (1.10) 


(人 ”cy 二 如) 
We now construct e by extending this diagram to arbitrary tv: 


(D,9) -一 popD,9) 
人 (1D) 


(zZ,0) 全 (z,y) 


That is, we define on 妃 x 有 4 (with 已 aneighbourhood of zo on which l-I(z;,0) is 
defined) 
e(2， 2) 二 %y (1(z， 0)) ， 
For (z,V) near some fxed (Z, 人 办, we have by (1.10) withy 一 人 andinstead ofy 
and to that 
e(0 细 三 弛 (2 一 仿 ) ， 

which shows that e is Symplectic, being locally the composition of symplectic trans- 
formations. The property (1.9) is obvious from the definition of e and from the com- 


mutativity of the fows oo 加. Since l -1(z,0) E Ma and Ma is invariant under the 


fows o 辐 ， we have e(Z,V) E Mz for all (z,y). 

It remains to show that e : {Z} x 陈 4 一 Ma is surjective for every Z near 
20. Let (世人 be an arbitrary point on Mz. By assumption, there exists a path on 
Maz connecting l-1(z,0) and ( 田 四 . Moreover by (1.10) and by the compactness 
of the path, there is ab > 0 such that, for every (D,9) on this path, the mapping 
V mm wy(p;,9q) is a diffeomorphism between the ball |ly|| < 6 and a neighbour- 
hood of (P,g) on MTherefore, ( 仑 们 can be reached from l-1(z,0) by a finite 
composition of maps: 

信 习 =pm oo 人 (0) 一 eg (0)=e(z) 力 ， 


where 分 = VD 十 .. .十 Wo once again by the commnutativity of the flows o 辣 ， 
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了 lustration of the Liouville Transform. We illustrate the above construction at a 
simple example, the pendulum (IL.1.12) with Hamiltonian 万 = 22/2 -- cosdq. The 
first coordinate is z = (p,q),afirstintegral. The second coordinate Vis, following 
(1.11), the time 上 which is necessary to reach the point (2, q) from an initial line， 
which we assume at dg = 0. Then we have (Fig.1.1 lefb ap ad = qd 瑟 ul (because 





Fig. 1.1. Liouville and action-angle coordinate transforms 


of dd = 局 tandd 巨 = 瑟 pdp). Wesee again that we have area Preservation， 
because the Symplecticity of the fow preserves this property for all titmes. This 
Symplectic change of coordinates (p,9q) 一 (z,V) is ilustrated in Fig. 1.2, which 
transforms the problem (A) to a much Simpler form (B) with uniform horizontal 
Imovement. 

















































































































Fig. 1.2. Liouville and action-angle coordinates illustrated at the pendulum problem 


We are not yet completely satisfied, however,， because the orbits have periods 
0 =9( 瑟 ) which are not all the same. We therefore append a secomd transform by 
putting 0 三 加 :tt (See picture (C) in Fig.1.1 and Fig.1.2), which forces all periods 
into a Procrustean bed of length 27. Area preservation da qd0 = qd 万 dt now requires 
that 2r da = 9( 刀 ) qd 万 , which is a differential equation between a and 万 . The new 
coordinates (a,0) are the actiom-a1gle variaples and we see that they transform the 
phase space into 刀 x TI where 万 C 月 !. We again have horizontal movement, but 
this time the Speed depends on w. The general existence for completely integrable 
Systems Will be proved in Theorem 1.6 below. 
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X.1.3 Action-Angle Variables 


We Show here that, under the hypotheses of Liouville's theorem，we can 
fnd symplectic coordinates (I p) such that the first integrals FF depend 
only on and wp are angular coordinates on the torus AMf . 

(V.I. Arnold 1989, p. 279) 


Wearenow in the position to prove the main result of this Section, which establishes 
asSymplectic change of coordinates to the So-called actio7z-a1181e vaFiaples, Such that 
aq first integrals of a completely integrable System depend only on the actions, and 
the angles are defined globally mod 27 (provided the level sets of the first integrals 
are compact). This is known as the Arnold-Liouville theorem; cf. Arnold (1903， 
1989), Arnold, Kozlov 儿 Neishtadt (1997; Ch. 4, Sect. 2.1), Jost (19608). Here and 
in the following， 


T4 = 了?/27rZ4 = {(0 mod 2r,... ,0 mod27r) ; 0 ee 及 } 
denotes the standard w-dimensional torus. 


Theorem 1.6 (Arnold-Liouville Theorem). Zef , .EM 一 开 De Jst 
111eg1al1s of a co111Dletely Iteg1apDie syste1a ds 1 De1i1aition 1.3. SUDPose 1Pat 1ne 
level yet Ma (see (1.8)) ae co1ipact a11d co1t1lected Jor az 大 da711etg1porrpood 
orzZ0 E 下 4 77jpem tere are meigjhpoxrjoods 局 ofzo and or0 六 月 4 szcp tpat 


1je Jo1iowing holads: 
(人 ForeyermyZ E 了 1peevelsyetAM Ma ad-dizme1zsio11al 1o1U8 1at 13 10VG7iG1 
WU1Ger 1e jiow ofipe syste1a wii Dazailtonian 有 (三 1 dl). 


(iTjexe exis18 G Dijective Sy11Plectic 11ra12s1o7711G1IO1 
办 :万 xTa4 一 U AM CR4xR2:(a0)r (D,9) 
ZE 已 


SUC1 1Pat(FEioW)(a,0) depends oly om Q Le.， 


及 (D;9) = 态 (a) JoOr (2,9) 一 W(a,0) (一 1 .dg) 
Wi Jpctio1y 广 : 户 一下. 


The variables (oa,0) = (ai ,ad0lmod27r,...,04mod2r) are called 
QGC1ioO1-G1181e VG7IGD1eS. 


Remark 1.7. If the level sets Mz are not compact, then the proof of Theorem 1.6 
Shows that Mu is diffeomorphic to a Cartesian product of circles and Straight lines 
TA x 了 月 % for someK < d and there is a bijective symplectic transformation 
(a0) 一 (P,d) between 万 x (Te xx 及 5) and aneighbourhood LU{fAz :ze 号 
of Muz。 such that the first integrals again depend only on w. 


Remark 1.8. If the Hamiltonian is real-analytic, then the proof shows that also the 
transformation to action-angle variables ls real-analytic. 
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Proofof7Tjpeore11a 7.6. (a) We return to Theorem 1.4. For Z E 已, we consider the 
Set 


及 = 也 ER ie(z 纺 =e(z;0)} 


Since e ls locally a diffeomorphism, for every fxed yo E 7xo there exists a unique 
smooth function 7 defined on a neighbourhood of zo, such that 7(Zzo) = yo and 
1(Z) E for z near Z0. In particular 六 is a discrete subset of 及 4. By (1.9)， 
fory E 忆 wehaveelzy + = e(zu) for alvw E 陈 4. Therefore, mis a 
Subgroup of 术 4, ie., with ve 六 alsoy+wE 六 and-ye 妆 .Itthenfolows 
(See Exercise 4) that 7z is a grid, generated by < d linearly independent vectors 
i (oz ,9(z) E 4 


交 ={maigli(Z) 十 .十 TOk(Z) ;Ti ED . 


We extend g1(Z) ,gp(Z) to abasis gli(zZ), ,gd(Z) of 有 4. Then, e induces a 
diffeomorphism 


Te xRRde 一 11 





天 dd 
(0 .0 TH1 Ta) 上 c(w 下 wo+ ， 万 9 加 )， 
i 一 1 7 一 K 二 1 


HI Ma is compact then necessarily 有 一 wand M2 is atorus. The above map then 
becomes the bjjection 


T4 一 AM :0 ia 生 gi() 5 
i1 


(b) Next we show that gi(zZ) is the gradient of some function Li(Z). For nota- 
tional convenience,， we omit the Subscript ? and consider a differentiable function 9 
with 

e(2,9(Z)) = e(zZ;,0) ， ZE， 


or edquivalently， 
loelzglo)=(0， ze 


Differentiating this relation gives (with 7 the wd-dimensional identity) 


了 了 
RN 
where 4 is the Jacobian matrix of loeat (z,9g(z)). We thus have 
全 
gz 7 0 


Since lo e is a symplectic transformation， we have 47.14 = .jand hence the 
above equation reduces to 
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qz) 一 gz)=0. 
By the Integrability Lemma VI.2.7, there is a function UV such that q(Z) = YU(z). 
We may assume LU(zo) = 0. 
(c) The result of (b) allows us to extend the bijection of (a) to a Symplectic 
transformation. For this, we consider the generating function 


d 
lb 
9(z,0) 王 过 5 Li(Z) . 
With wu(z) = (Di(z) ,Da(z)),the mixed second derivative of S is 


So(z 有 = 到 we 加 = 到 (oo ou) ， 


which is invertible because of the linear independence of the 9i. The equations 


Wz(Z) 一 


065 1 059 必 4 
4 一 页 = 元 wz)， 9 
define a bijective Symplectic transformation (for Some nelghbourhood D of 0, and 
possibly with a reduced nelghbourhood 已 of zo) 


d 
8: 万 x 了 4 一 巨 xR4:(a0) mm (zy 一 (并 天 oa)) 


wherez = j(a) is the inverse map of a = 亏 V(Z). We now define 


办 =eo8:DxR? 一 U AI2 . 

ZE 咏 
By construction, this map is smooth and symplectic, and such that 户 (a) = 2; 一 
太 (p,dq) for (pp,dq) 三 坊 (a， 0). Itis surjective by Theorem 1.4. By part (a) of this 
proof, it becomes injective when the 0 are taken mod 2r, thus yielding a transfor- 
mation defined on 刀 x TY with the stated properties. 














X.1.4 Conditionally Periodic Flows 
An immediate and Important consequence of Theorem 1.0 is the following. 


Corollary 1.9. 7 1e situatioz of 7T1eore11z 1.6, co1lsider 1pe co1plete1y itegrapie 
SySte1a Wi 再 Gmailiomiazz 三 碑 . 太 妨 e action-a118ie variaples (a 0 1pe 瑟 azmi- 
1O11G1 egUGQ1iO1S Deco111e 


中 ==0，b=uwio (= ,dg) 
wii ui(a) =OK/bai(al, wpereK(a) = 三 瑟 (D,q)Jjor(p,d) =Wp(aO0)， 
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The fow of a differential System 
0=w， ww 一 (wi) E 了 4 


on the torus Ta is called comdifiomral1y per- 
odic with fredguencies wi. The flow is peri- 
odic if there exist integers ji Such that for 
any two frequencies the relation wi/wj 三 
应 /1 holds. Otherwise, the fow is called 
GdUaSI-periodic. In particular，the jlatter oc- 
cuUIS when the fredquencies are rationally Independent，oT 11o1-7eso1a11t the only 
integers /ii with Fiwl 十 .十 Kowd = 0areki =. .= 1 三 0.For non- 
Tesonant frequencies, it 1s well known (See Arnold (1989), p.287) that every tra- 
jectory {0( :LE 了 } is dense on the torus Td and uniformly distributed. 





了 xample 1.10， We take up again the example of motion in a central feld,，Exam- 
ple 1.2. For given 如 and 忆 , we now assume that 


位 >0;2( 瑟 一 Y( 站 ) 一 到 /2 > 0 = [ro7i] 


is a non-empty interval and the derivatives of 2( 妃 - Y( 门 ) - Z2/r2 are non- 
vanishing at 7ro, 71. By (1.7), the motion from yo to 7rl and back again takes a time 
了 and runs through an angle which are given by 














人 1 1 
0 1.12 
由 了 

的 ZL/O 
了 一 2 ao0O . 1.13 
、 VOD ee 


Note that ro, 71, ,2 are functions of 妃 and 二 .The solution is periodic 让 史 is 
arational multiple of 27r. This occurs for the Kepler problem, where 到 = 27 and 
where 了 全 = 2r/(-2 万 )3/2 (for 万 < 0) depends only on 万 ; see Exercise LI.5. 

We now construct action-angle variables and compnute the frequencies of the 
System. We begin by constructing the mapping e(z,y) as defined by (1.11) for the 
variables Z = (Z1,7Z2) = (也 ,站 andy = (2) of (1.60). For a given (zy)， 
We consider (2Z,0) and we fx (p,9) with p = (pr,Dpo) andd 三 (mwp) such that 
L(p,9) = (Z,0),e.g., by choosingr = 7o,p = 三 0,D 三 0,Dpo = 工 .The mapping 
e(z,V) is defined by the fow at timet = 1 corresponding to the Hamiltonian 








属 = 纪 +o 工 = 六 (二 十 por 十 TD) 十 gpe ， 


ij.e., by the solution at 上 一 工 of 


2 


D2 ， . 
rr 一 一 0 T 矿 科 ) 一 0 
六 见 二 一 gj 本 全 


. 万 
六 一 WiDr ， 全 


X.1 Completely Integrable Hamiltonian Systemas 401 


If we denote the ftow of the original system with Hamiltonian 如 (pr,poy7,2) by 
Di then we have 


e(Z;V) 二 %w (0， 也 ,70,0) 宙 (0,0， 0,22) 


with the last component taken modulo 27r. Hence，the values of Y Satisfying 
e(Z,V) = e(Z,0) are 
三 mt19I(Z) 十 70292(Z) 


with integers 701,702 and 


(2 人 


We know from the proof of Theorem 1.6 that 9l and 92 are the gradients of functions 
[7( 万 , 卫 ) and UV2( 隐 ,也 ),respectively. Clearly, Da = 27 克 . The expression for Li is 
less explicit. With the construction of the Integrability Lemma VI.2.7, this function 
is obtained by quadrature, in a neighbourhood of (万 o,Zo), as 


1 
DZD= (可 TCRATe- +s-IO)- 
0 
(Fo0B(PF+s( 万 - 瓦 ), Fo+s( 一 Lo))qs ， 
(For the Kepler problem, 了 = 27r/( 一 2 万 )3/2, 有 = 0 mod 2r, and hence 三 
27/V--2 万 .) For the action variables we thus obtain 


工 
呈 均 三 (三 力 ， a2 一 了 . 


The angle variables are given byy = 去 (0191 十 02g2),ie.， 


2 克 2 
0 一 过 = 0。 一 2 1.1$ 
1 二 纹 万 ， 2 多 十 太志 (1.15) 


Writing the total energy 妃 = 开 (al, 卫 ) 这 ai is given by the above formula we 
obtain, by differentiation of the identity 2rail = Li (KK(al 万 ,万 )， 


maK 60 90 


2 丰 王 一- 一 全 es 
2， 亲 7 7 


and hence the frequencies 


芝兰 。 (1.16) 


402 X. Hamiltonian Perturbation Theory and Symplectic Integrators 


X.1.S The Toda Lattice - an Integrable System 


Our method is based on the realization that the Toda lattice belongs to 
a class of evolution equations which can be studied, and in some cases 
Solved, by utilization of a certain associated eigenvalue problem. 

(H. Flaschka 1974) 


Classical examples of integrable Systems from mechanics include 天 eplers problem 
(Newton 1687/1713, Joh. Bernoulli 1710), the planar motion of a point mass at- 
tracted by two fxed centres (Euler 1760), Keplers problem in a homogeneous force 
field (Lagrange 1766 solved this as the limit of the previous problem when one cen- 
tre ls at infinity), various Spinning tops (Euler 1758b, Lagrange 1788, KovalevSskaya 
1889, Goryachev 18995 and Chaplygin 1901), a number of integrable cases of the 
motion of a riglid body in a fuid, the motion of point vortices in the plane. We refer 
to Arnold, Kozlov 儿 Neishtadt (1997) and Kozlov (1983) for interesting accounts 
of these problems and for further references. 

Here we conslder the celebrated example of the IJoda lattice which was the start- 
ing point for a huge amount of work on integrable Systemas in the last few decades， 
with fascinating relationships to Soliton theory in partial differential equations (most 
notably the 天 orteweg-de Vries equation) and to eligenvalue algorithms of Numerical 
Analysis; See Deift (1996) for an account of these developments. 

The Joda lattice (or chain) ls a system of particles on a line interacting pairwise 
with exponential forces. Such Systems were studied by Toda (1970) as discrete mod- 
els for nonlinear wave propagation. The motion ls determined by the Hamiltonian 


Hg = (3 有 +exmp(w 一 or) (LI7) 
左 一 工 


Two types of boundary conditions have found particular attention in the literature: 

(D periodic boundary conditions: qn+1 三 91; 

() put formally w+1 = 十 co, so that the term exp(dn 一 qn+1) does not appear. 
It was found by He6non, Flaschka and independently Manakov in 1974 that the pe- 
riodic Toda System is integrable. Moser (19735) then gave a detailed Study of the 
non-periodic case (1). 

Flaschka (1974) introduced new varlables 

oK 一 一 3， 了 肽 一 二 exp(3(dk 一 gp+1)) . 

(Take b = 0 in case (ii)). Along a solution (六 ,qd() of the Toda system, the 
corresponding functions (a( 加 ,ob( 芭 ) satisfy the differential equations 


拔 一 2( 辽 一 如 -1 ， 基 王 姑 (akHl 一 on) 


(with an+1 = al in case (人 ,0 三 0 in case (ii)). With the matrices 
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a1 0 bn 

b1 CQ2 D2 0 
po aa 03 

了 三 ) 

0 bn 一 2 Q7m 一 1 bn 王 

pn pn 1 Qm 

0 01 一 bn 

--b01 0 0 0 
-bj 0 有史 

刀 = BOD) = . . ， 
0 一 bn 一 2 0 pn -1 
bn 一 bn 1 0 


the differential equations can be written in the Zax pair form 
天 二 B5D-7B. (1.18) 


This System has an isospectral pow, that is, along any solution 卫 ( 芒 of (1.18) the 
eigenvalues do not depend on 已 see Lemma IV.3.4. The eigenvalues 和 1,...,》Xn of 
了 上 are therefore first integrals of the Ioda System. They are Independent and turn out 
to be in involution, in a neighbourhood of every point where the Xi are all differ- 
ent; See Exercise 6. Hence, the TIoda lattice is a completely integrable System. Its 
也 amiltonian can be written as 


万 = >》 (2 吗 十 4 如 ) = 2trace72 一 2y、 2 
三 小 


5 


We conclude this section with a numerical example for the periodic TIoda lattice. 
We choose 寻 =3andthe initial conditions pl 三 一 1.5, Dp2 = 1 53 = 0.5 and 
d1 1， da2 2, 03 一 1. We apply to the System with Hamiltonian (1.17) the 
Symplectic Second-order Stormer-Verlet method and the non-Symplectic classical 
fourth-order Runge-Kutta method with two different step Sizes. The left Pictures of 
Fig.1.3 show thenumerical approximations to the eigenvalues, and the right pictures 
the deviations of the eigenvalues 和 Xl, X2, 和 3 along the numerical solution from their 
initial values. Clearly, the eligenvalues are not invariants of the numerical Schemes. 
However Fig.1.3 iustrates that the eigenvalues along the numerical solution re- 
main close to their correct values over very long time intervals for the Symplectic 
method, whereas they drift off for the non-symplectic method. 

An explanation of the long-time near-preservation of the first integrals of com- 
pletely integrable Systems by Symplectic methods will be given in the following 
Sections, using backward error analysis and the perturbation theory for integrable 
Hamiltonian Systems. 
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Stormer/Verlet method 














| 
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| 1 | 
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classical Runge-Kutta method of order 4 
































5 -04 
1 .02 上 
1 T 上 T 上 -一 t 
5000 10000 15000 0 0 200 
< .02 人 
时 .04 上 











Fig. 1.3. Numerically obtained eigenvalues (left pictures) and errors in the eigenvalues (Tight 
pictures) for the step Sizes 玉 三 0.1 (dotted) and 玉 三 0.05 (solid line) 


X.2 Transformations in the Perturbation Theory 
for Integrable Systems 


Probleme g6nkral de la Dynamique. Nous Sommes donc conduit inous 
proposer le probleme suivant: Etudier les 6quations canoniques 


ct 了 avi dt Qi 
en SuUpposant que la fonction F peut se d6velopper suivant les puissances 
dun parametre tres petit / de la maniere Suivante: 
=FoT+HEI 二 H2F2 十. 


en Supposant de plus que Fo ne de6pend que des Z et est indEpendant 
des Vi et que Fl1,F2,...Sont des fonctions pkriodiques de pkEriode 27 par 
Tapport aux 7/. (H. PoincarE 1892, p. 321.) 





Consider a Small perturbation of a completely integrable Hamiltonian. In action- 
angle variables (au,0) on 刀 x T4 (D an open subset of 玉 d, this takes the form 


瑟 (a,0) 了 五 o(a) 世 EDi(a， 0) ， (2.D) 


where s is a small parameter. We assume that 互 o and 有 are real-analytic, and that 
the perturbation 万 1 (which may depend also on s) is bounded by a constant on a 
complex neighbourhood of 九 x T4 that is independent of <. No other restriction 
Shall be imposed on the perturbation. 

For the unperturbed System (E 三 0) we have Seen that the motion is conditionally 
periodic on invariant tori {a = const,0 < T4}. Perturbation theory aims at an 
understanding of the fow of the perturbed System. The basic tools are Symplectic 
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coordinate transformations which take the System to a form that allows the long- 
time behaviour (perpetually, or over time scales large compared to <s-1) of solutions 
of the System (certain solutions, or all solutions with initial values in Some ball) to be 
Tead off. There are different transformations that provide answers to these problems. 
The emphasis in this section will be on the construction of sulitable transformations， 
not on the technical but equally important aspects of obtaining estimates for them. 

The methods in PoincarE's Melpodes Noxvelles form the now classical part of 
perturbation theory, but the theories of Birkhofft, Siegel, Kolmogorov/Arnold/Moser 
(AMI) and Nekhoroshev in the 20th century have become“classicsS” in their own 
Tight. 


X.2.1 The Basic Scheme of Classical Perturbation Theory 


In the Spirit of the preceding Section, one might search for a Symplectic change of 
coordinates (a, 0) 一 (b, p) close to the identity such that the perturbped Hamiltonian 
written in the new variables (D, P) depends only on 交 or more modestly, depends 
only on up to aremainder term of order O(sN) with alarge N > 1l, or to begin 
even more modestly, with N = 2. We Search for a generating function 


5( 六 0 二 0.0 二 5S1(0g 


where . symbolizes the Euclidean product of vectors in 下 4 and 5S1 is 2r-periodic 
in 0. Naively, we require that the symplectic transformation defined by 


be such that the order-s term in the expansion of the Hamiltonian in the new Vari- 
ables, 玉 (b PP) 三 五 (ao0), 开 (opP) = 万 0() 十 sK1(0 op) 十 .. .depends only on 0. 
Since 


站 O51 加 D591 
万 (a,0) = 瓦 (is 他 0)， 0) 一 万 0( 忆 十 E 人 . 页 出 0) 十 五 1(b， 站 二 
with the vector of frequencies 
OD0 
w(D) 二 -到 人 世 ， 
the function 91 must satisfy the partial differential equation 
D9 三 - 
wo 信人 + 而 他 凡 一 豆 ( C.2) 


for a function 万 that does not depend on 0. Since 91 is required to be 2Tr-periodic 
in 0, the function 束 1; must equal the average of 万 1 over the angles: 


击 疝 三 三 (9) dg. 
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Equation (2.2) ls the basic equation of Hamiltonian perturbation theory. From the 
Fourier series of 591 and 万; ， 


Si 人 = >》 sk)epe 而 (及 = >》 及 ( 们 ee 


KEZ4 KEZd 


we obtain a formal solution of (2.2) by comparing Fourier coefficients: so0(b is 
arbitrary and 
三 吕 庆 国人 
icw(D) 
Atthis point, however we are Struck by the pfoplema of szzzall deznzo11az12ators. For any 
values of the frequencies wj(b), the denominator 有 .w(b) = Aiwl( 世 十 .十 Kuwd(D) 
becomes arbitrarily small for some 天 一 (1 . .Ko) E Zd, and even vanishes ifthe 
frequencies are rationally dependent. 

For a perturbation where only finitely many Fourier coefficients AP are non- 
Zero, the construction above excludes only a finite number of resonant frequencies 
(ie., those with 有 .w( 人 = 0forak e Za with 和夫 0)andsmallneighbour- 
hoods around them. For w(b) outside these neighbourhoods and for P on a complex 
neighbourhood of Td, we obtain for the Hamiltonian in the new variables 


天 (D， 0) 一 万 0(D 十 < 万 1 (中 十 O(c5) 


sk(D 一 大 尖 0. (2.3) 


In the general case，we can approximate the perturbation 万 up to O(s2) by a 
trigonometric polynomial. For analytic 万 1i, the Fourier coefficients jdecay expo- 
nentially with | 丰 | = > ;| 尼 |, and hence the required degree 7 of the approximating 
trigonometric polynomial grows logarithmically with s, i.e., 7m 全 | log e|. 

As s 一 0, the remainder term is under control only for those frequencies 
w 一 w(b) for which the exponentially decaying Fourier coefficients Aiz of the pertur- 
bation decay faster than the denominators i . w with growing | 太 |. This is certainly 
the case for frequencies Satisfying Siegel1s dioppa1mtine co11dGiiO (OF S11ro118 1101- 
1eso11G11Ce CO1Gifio1, as it 1s Sometimes called) 


IEowl 关 7 KEZe 关 0 C.4) 


for some positive constants 7, Z. (Here again, | 有 | 二 > ;ji). 玉 7 > qd 一 1,thesetof 
frequencies in a fxed ball that do 1zof Satisfy (2.4) has Lebesgue measure bounded 
by Comst . ?7 (Exercise 9). Therefore, almost all frequencies Satisfy (2.4) for some 
7y > 0. However for any >y and v, the complementary set is open and dense in 了 及. 


X.2.2 Lindstedt--Poincare Series 


.… pour que la methode de M. Lindstedt Soit applicable，Ssoit Sous Sa 
forme primitive, soit sous celle que je lui ai ensuite donnke, il faut qu"en 
premiere approximation les moyens mouvements ne Soient liks par au- 
cune relation linkaire a coefficients entiers; .…. 


了 semble donc permis de conclure que les skries (…) ne convergent pas. 
Toutefois le raisonnement qui prEcede ne suffit pas pour etablir ce point 
avec une Tigueur complete. (H. PoincarE 1893, pp. vi, 103.) 
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Fig. 2.1. Henri Poincare (lefb, born: 29 April 1854 in Nancy (France), died: 17 July 1912 
in Paris; Anders Lindstedt (righb, born: 27 June 1854 in Sundborn (Sweden), died: 1939. 
Reproduced with permission of Bib1l. Math. Univ. Geneve 


The above construction ls extended without any additional difficulty to arbitrary 
finite order in s. The generating function 1s now Sought in the form 
3( 六 0) 一.0 二 Es591(00) 十 s252( 思 人 十 .十 ELSNw_1(00) (2.5) 


and， as before, the requirement that the first N terms in the s-expansion of the 
Hamiltonian in the new variables be independent of the angles, leads via a Taylor 





expansion of the Hamiltonian to equations of the form (2.2) for 51,... ,9N_1: 
O9; 上 
vc 南 十 本 信人 ) 一 天 7 人 (2.0) 
where Ki1 = 万 1， 
殊 二 1 D2 万 0 0O91 0O91 O 万 1 0O91 
2 2pb2\ 5 00) 5 6060， 


and in general A7y is asum of terms 





1 DDN Do9k) ON， 

: 多 th Ho 十 a 十 .十 太一 了. 
(2 SA 

The function 天 7 denotes again the angular average of 用 7). These equations can be 
formally Solved in the case of rationally independent frequencies. The Hamiltonian 
in the new variables is then 
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开 (bP) = 万 0( 人 十 E 丽 1( 人 二 s2 天 2( 人 十 .二 ET 一 I 开 ww_1( 人 + Rw( 人 00) (2.7) 


The possible convergence of the series for AN 一 oo ls a delicate issue that was 
not resolved conclusively by PoincarE (1893) in his chapter on “Divergence des 
SEries de M. Lindstedtf” 开 for Some 妨 ， the series (2.3) together with its partial 
derivatives converged as N 一 co, then { = 大,P E Td} would be an invariant 
torus of the perturbed Hamiltonian System. However it was not until olmogorov 
(1934) that the existence of invariant tori - for diophantine frequencies - was found， 
Using a different construction. A direct proof of the convergence of the Series of 
classical perturbation theory for diophantine frequencies was obtained only in 1988 
byEliasson (published in 1996); also see Giorgilli & Locatelli (1997) and references 
therein . 

Nevertheless, already the truncated Series (2.3) leads in a rather Simple way to 
Strong conclusions about the How over long time scales when it is combined with the 
idea of approximating the Hamiltonian by a trigonometric polynomial: the“ultra- 
Violet cut-off”, an ldea briefly addressed by PoincarE (1893), p. 98f., and taken to 
its full bearing by Arnold (1963) in his proof of the 天 AM theorem. We formulate a 
lemma for a fxed truncation index NV. Here, we N(I) denotes the derivative of the 
truncated series (2.7) with respect to 0. 


Lemma 2.1. SuPpPpose 妨 af w( 妨 ) satisjies 1pe diopjpantine coadition (2.4) For ay 
Jixed N 之 2, 1je1e are posifive co118ta11s so0, CC SUCH 1Pat 1Pe jlowing 1olads or 
E 近 co:1jpereexistya7eal-a1zalytic sy1iplectic cna1age ofcoojdipatey (a,0) 一 (DOD) 
SUC1 1Patf every SO1Uion (b 蚊 2) oftpe pertrped syste 大 玉碎 e em coordinatesy， 
S1G11i128 Wi | (0) 一 姑 | < cllog sl 一 1 satisjies 


2 人 一 OO<Ctey Jor t<sT+l， 
|e 的 一 wew(O))E 一 2(0) EC( 妇 十 直 loges| 呈 sw Jr 妇 和 es 全 


Moreoyver tpe tansfpomaation 1 O(E)-close io tpe idenfip (oa0) 一 (op < Ce 
jpolds jpor(a;,0) and (bpP) 7elated by te above coo1dipnate 1ra1ts1jomzza 加 or 一 六 | 忒 
cllogs|l-“ 一 1 andjJjorepzimnans-independent complexr teigjpbpoxrpood orT4. 

77je co1msia11s so;cC depe1d on Nd7Z ad on potadsy of Do and 1 om 4 
comzDIex 7neigjhpoxrpood of { 姑 } x T4. 


Prooj Using the relations (2.3) and their analogues for (2.6), it is a straightforward 
but somewhat tedious exercise to Show that at the given particular 妨 , the functions 
开拓 Si( 姑 ) are all analytic on the same complex neighbourhood of Te, and 
that the remainder term is bounded by 


RN(O 0 EC=CUN TI) 


for all 0 in a complex neighbourhood of TY which is independent of <. Here，C 
depends in addition on the bound of 瑟 ; on a complex neighbourhood of {zz} x Tv， 
or what amounts to the Same by Cauchy's estimates, on bounds of the exponential 
decay of the Fourier coefficients PK of 妃 1. (In case of doubt, see also 9ect.X.4 for 
explicit estimates.) 
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Assume first that 五 1(b 0) is atrigonometric polynomial in 0 of degree 7. Then 
天 ) 5 are trigonometric polynomials of degree 77m. Since | 有 .w(O)| > | 大 ww( 妨 )| 一 
Emmax lw 一 巡 |thereisa6 > 0such that 


和 (全 | > 幸 因 -for 上 一 下 | < 5 <Nm. 


This number 6 is proportional to y(Vmz)-“ 一 !. Consequently, since the construction 
involves only the trigonometric polynomials 玉 )， 97 of degree up to Nm the above 
estimate for the remainder term 刃 w holds also for | 一 巡 | 么 06. To approximate 
a general analytic 万 ; by trigonometric polynomials up to O(ev ), we must choose 
the degree 7m0 proportional to |log sX|. With the choice 6 = c(N?llogel)- 一 ! 

for a suffticiently small c > 0 independent of s (and Wj, the above bound for the 
remainder 民 N(b, 0) is then valid for p in the complex ball | 一 巡 | 和 20 and for 
oO in a complex neighbourhood of Td (which depends only on NV). By Cauchy's 
estimates, this implies 


for | 一 大 | < 6andb0 ETe4. Hence, aslongas | 人 一 天 | < 0,the Hamiltonian 
differential equations are of the form 


DR LN DORN 00 
op 0 ap 


This implies the result. 


,6 
二 





=wev(I) 十 O(ev7160) . 














Hence, the tori { = 80)，P E TY are nearly invariant over a time scale 
EN+l, and the fow is close to a quasiperiodic fow over times bounded by the 
square Toot of s-w+1. Lemma 2.1 is just a preliminary to more substantial results 
(which hold under appropriate additional conditions): invariant tori carrying a quasi- 
periodic fow with diophantine frequencies persist under small Hamiltonian pertur- 
bations (Kolmogorov 1954); every Solution of the perturbed System Temains close， 
within a positive power of s, to Some torus over times that are exponentially long in 
anegative power of s (Nekhoroshev 1977); Solutions starting close to an invariant 
torus with diophantine frequencies Stay within twice the initial distance over time 
intervals that are exponentially long in a negative power of the distance (Perry 祥 
Wiggins 1994) or even exponentially long in the exponential of the inverse of the 
distance (Morbidelli & Giorgilli 1993). 

The symplectic transformations of this Subsection were constructed using the 
mixed-variable generating function 9(2, 0). As was pointed out for example by 
Benettin，Galgani 文 Qiorgilli (1985), rigorous estimates for the remainder terms 
are often obtained in a simpler way using the Zie 1etpod, which involves construct- 
ing the near-identity Symplectic transformation as the time-s fow of some auxiliary 
Hamiltonian system with a suitably deftined Hamiltonian X(b, 2). As before, the 
condition that the Hamiltonian 五 (a,0) = 天 (bp) should depend on p only in 
higher-order terms, leads to equations of the form (2.2), now for X instead of 591. 
We will use Such a construction in the following Subsection. 
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X.2.3 下 olmogorov's Iteration 


It is easy to grasp the meaning of Theorem 1 for mechanics. It indicates 
that an s-parametric family of conditionally periodic motions [.…] cannot， 
under conditions (3) and (4) [here: (2.4) and (2.9)], disappear as a result 
of a small change in the Hamilton function 万 . 


In this note we confine ourselves to the construction of the transforma- 
tion. (A.N. Kolmogorov 1954) 


For the completely integrable Hamiltonian 五 o(a), the phase space is foliated into 
invariant tori parametrized by w. We now fix one such torus fa = ax*，0 ec T4} with 
Strongly diophantine frequencies w = w(a”). Without loss of generality, we may as- 
Sume au” 三 0. This particular torus ls invariant under the fow of every 了 Hamiltonian 
妃 (a,0) for which the linear terms in the Taylor expansion with respect to a at 0 are 
independent of 0: 

万 (awg) =c+w:a+gazM(aob)a (2.8) 


with c E 月, w E 下 4 and a real symmetric dg x dmatrix M(a,0) analytic in its 
arguments. Since the Hamiltonian equations are of the form 


a=O(lal，0=w+O(lal)， 


the torus {a = 0，0 < Tdl is invariant and the fow on it is quasi-periodic with 
fredquencies w. 

Considernow an analytic perturbation of such a Hamiltonian: 束 (a,0) 二 sG(a,O) 
with a Small =. Kolmogorov (1954) found a near-identity Symplectic transforma- 
tion (oa, 0) 上 (z 骨 , constructed by an iterative procedure, such that the perturbed 
Hamiltonian in the new variables is again of the form (2.8) with the Same w, and 
hence has the invariant torus {a = 0， b e Te} carrying a quasi-periodic flow with 
the frequencies of the unperturbed System. This holds under the conditions that w 
satisfies the diophantine condition (2.4), and that the angular average 


1 
AM10 :一 [5 汪 1I(0,0)ad0 is an invertible matrix. (2.9) 
Here we describe the iterative construction of this Symplectic transformation. The 
proof of convergence of the iteration will be given in Sect. 和 .5. 
We construct a Symplectic transformation (a, 0) 上 (bp) as the time-s fow of 
an auxiliary Hamiltonian of the form 


Q 
X(2O) = 和 2 十 Xo(P) 十 》 0Xi(2)， (2.10) 
了 


where & E 了 2 is a constant vector, and Xo,X1,...,Xau are 27r-periodic functions. 

(Quadratic and higher-order terms in p play no role in the construction and are there- 

fore omitted right at the outset.) The old and new coordinates are then related by 
OX 


a=bp+e 基 0)+TOG )， =p 一 < 二 (信人 十 O(s2) . 
2 
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We insert thlis into 


瓦 (ab) 二 esG(a0) =c 二 ww 十 3OM(P)b 
DX OX 


+ . 二 全 (DO) 下 5 


OO+G@ +oclam+oc9， 
2 


We now require that the term in curly brackets be Comnst 十 O(| 串 2. Writing down 
the Taylor expansion 


地 
G(O=Go( 六 + >》 5Gi( 人 十 虹 Q(P)P C.1D) 
并 


and inserting the above ansatz for X, this condition becomes 





人 Er 人 
oO+a) 


d 
十 Go(P) 十 >》， jiGi(PD) = Comst.， 
记 1 





whereu = (ud anduy = (ui ,ua)7 are defined by 
U(O) = MI(00P)6， (2.12) 
0 
oO) = M(0,P) 本 2(p) C.13) 
W 
The condition ls fulfilled 1 
本 
w+Gog) = 可 C.14) 
DOxi 3 
2 人 TO)+GiO) 一 而 + 下 + 0.19) 
元 十 元 十 G =0 (一 1 .gd). (2.10) 


Here the bars again denote angular averages. Note that equations (2.14), (2.13) are 
of the form (2.2). Equation (2.14) determines Xo and hence v 一 (1 .ud)7 by 
(2.13). Equations (2.16) then give 元 = (zzo)7. By (2.12), we need 


五 = Mo6 ， 


which determines 上 《 uniquely because Mo is assumed to be invertible. Equation 
(2.12) then yields w = (ud). Finally, (2.15) determines X1,...，Xa, and 
the construction of X(b, 2) is complete. In the new variables (b, 2), the perturbed 
也 amiltonian then takes the form 


万 (wb)+sG(wg=E+w:D+3HOIMOPD+E2G(OP) (2.17) 
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with unchanged frequencies w and with (六 pp) = AMP) 十 O(e). The pertur- 
bation to the form (2.8) is thus reduced from O(s) to O(e). The iteration of this 
procedure turns out to be convergent, See Sect.X.3. This finally yields a Symplectic 
change of coordinates that transforms the perturbed Hamiltonian to the form (2.8). 
The perturbed System thus has an invariant torus carrying a quasi-periodic fow with 
frequencies w -a 开 AM torus, as it is named after Kolmogorov, Arnold and Moser. 


X.2.4 Birkhoff Normalization Near an Invariant Torus 


开 AM tori are very sticky. 
(A.D. Perry 人 放 S$. Wiggins 1994) 


In this Subsection we describe atransformation studied by Poschel (1993) and Perry 
贸 Wiggins (1994) for Systems with Hamiltonian in the Kolmogorov form (2.8) in a 
neighbourhood of the invariant torus {a = 0，0 ec T4d}. This transformation is an 
analogue of a transformation of Birkhoff (1927) for Hamiltonian Systems near an 
elliptic stationary point. 

The symplectic change of coordinates (a;,0) 一 (bp) considered here trans- 
forms a Hamiltonian (2.8) with diophantine frequencies w to the form 互 (co,0) = 
玫 N( 人 (十 O(i>) for arbitrary V, or more precisely, the Hamiltonian in the new 
variables, 妃 w(b, Pp) 三 五 (o0),is ofthe form 


瑟 vtoO) 三 w 0TTGv) 二 RN OP) (2.18) 


with GZv(b) = O( 人 2) and 已 N(oP) = @O( 加 六 ). CWe have taken the irrelevant 
constant term in (2.8) c = 0.) The equations of motion then take the form 


= O 人 >) ， 号 =w+O() 


Therefore, in these variables { = 0,P E T4} ls an invariant torus, and for suff- 
ciently Small 7， 


(0)<r implies | < 27r fort< Cr +l. 


Ajudicious choice of N even yields time intervals that are exponentially long in a 
negative power of 7 on which solutions starting at a distance 7 stay within twice the 
initial distance (Perry 儿 Wiggins 1994). Motion away from the torus can thus be 
only very Slow. 

The normal form (2.18) is constructed iteratively. Each iteration Step 1S Very 
Similar to the procedure in Sect.X.2.1, where now the distance to the torus plays the 
role of the small parameter. Consider a Hamiltonian 


万 (aw 人 =w:a+2G(a) 十 民 (a0) 


where Ga) = O(al2) and Raw0) = O(al5 for some 大 > 2 in a com- 
plex neighbourhood of {0} x T4. We construct a symplectic change of coordinates 
(a,0) 上 (0 op) via a generating function p .0 十 9(0) as 
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09 O9 


We expand (omitting the arguments (2, 0) in 93/60 and 9 万 /0a) 


9 9 万 059 
厅 (2+ 0 二 作风 二 二 二 亿 必 人 
9 万 059 
一 0420+10O+ 呈 责 }owg， 


where |&(, 0 和 Const. | 85/60l2. Since 9 有 /88 = w 二 O( 人 (we can make 
the expression in curly brackets independent of 0 up to @O(| 咱 人 +1) by determining 
oO from the equation of the form (2.2): 


D9 


ww 而 敢 人 十 RD 人 一 瓦 人 


For diophantine frequencies w, we obtain S(20) = O( 串 呈 on a Ceduced) com- 
plex neighbourhood of {0} xT4 from the corresponding estimate for 忆 (5 0). Itfol- 
lows that the above symplectic transformation with generating function 0.0 十 9(D, 0 
is well-defined for small || 咱 ,and the Hamiltonian in the new variables, 冰 (b, o) 一 
刀 (a,0), becomes 


一 


硝 (0 p) = 一 ww.D 十 2 人 十 天 (0 





09 -由 区 0+QC9= Ole ， 


So that the order in b of the remainder term is augmented by 1. The procedure can 
be iterated, but unlike the iteration of the preceding Subsection, this iteration is in 
general divergent. Nevertheless, a Suitable finite termination yields remainder terms 
that are exponentially small in a positive power of r for ||o| 和 ,by arguments 
Similar to those of Sect. 和 .4. 


X.3 Linear Error Grovwth and Near-Preservation 
of First Integrals 


In theremaining part of this chapter we Study the long-time behaviour of symplectic 
discretizations of integrable and near-integrable Hamiltonian Systems. While here 
we will be concerned with general Symplectic methods, it Should be noted that Some 
integrable problems admit integrable discretizations; See Suris (2003). 

In this section we are concerned with the error growth of symplectic numerical 
methods and their approximate preservation of first integrals. A preliminary analySsis 
of linear error growth for the 玉 epler problem was first given by Calvo 儿 Sanz-9erna 
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(1993). Using backward error analysis and 上 AM theory, Calvo 儿 Hairer (1995al) 
then Showed linear error growth of Symplectic methods applied to integrable SyS- 
tems when the frequencies at the initial value satisfy a diophantine condition (2.4). 
Here we give Such a result under milder conditions on the initial values, combining 
backward error analysis and Lemma 2.1. We derive also a first result on the long- 
time near-preservation of all first integrals, which will be extended to exponentially 
long times in Sections X.4.3 and X.3.2 (under Stronger asSumptions on the starting 
values), and perpetually in Sect. 和 .6 (only for a Cantor Set of step Sizes). 

Figure 3.1 illustrates the linear error growth of the Symplectic Stormer-Verlet 
method,， as opposed to the quadratic error growth for the classical fourth-order 
Runge-Kutta method, on the example ofthe Iodalattice. The Same number of func- 
tion evaluations was used for both methods. 








[nl ITTTTTI 
.8 六 global error | | 
6F RK4, 疡 一 0.08 
.4 上 
5 「 Stormer/Verlet 刀 = 0.02 
| AAA | 1 1 1 1 | 
50 100 








Fig. 3.1. Euclidean norm of the global error for the Stormer-Verlet Scheme (Step Size 几 三 
0.02) and the classical Runge-Kutta method of order 4 (Step Size 玉 = 0.08) applied to the 
Toda lattice with 即 =3 and initial values as in Fig.1.3 


We consider a completely integrable Hamiltonian System (usually not given in 
action-angle variables) 


8 8 厅 
有 0 (3.1) 


and apply to it asymplectic numerical method with step Size 几 , yielding a numerical 
Solution sequence (pn, qn). We assume that the Hamiltonian is real-analytic and that 
the conditions ofthe Arnold-Liouville theorem, Theorem 1.6, are fulftilled. Consider 
the symplectic transformation (P, dg) = W(a,0) to action-angle variables. We denote 
the inverse transformation as 


(0) = (7(p,q),9(p,9)) ， (3.2) 


Werecall that the components 1, ,7of7 = () arefirstintegrals ofthe system: 
TD 人 (bd 的 ) = To,ao) for al 上 In the action-angle variables, the Hamiltonian is 
Mt(a) = 万 (D,9qj, and we denote the frequencies 


w(a) 三 Co (3.3) 


We consider this in a neighbourhood of some a* E 下 4. 
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Theorem 3.1，Co7zsider apPPL1yig da 8Sy11DPlectic 11U111erical integ1ator of order D 1o 
1 太 pe co1iD1etely integraple 已 aailiomia1a syste1a (3.7) SUPPpose tpat w(a”)] Satisjies 
1pPe diopjpa1tiaze co1adifioz (2. 引 7T1e1， 加 eye exXist Positive co11sta11 Cc ad Po 
SUCHP 1Pat 1je jiowing Holds jpor Ga1 step Sizey 凡 芭 R0: eyery 11U111e1ical SOUUiO7 
S1G11i18 Wizp 7T(po, go) 一 ax < cllog P 一 satisjies 


CEAP 
Cj 


| gm) 一 (Pd(b) 


Jor tl=7 几 区 几 2 . 
|7(pn on) 一 了 (po, dao) 


| < 
| < 
771e co1sta11s 六 o, c, C depe1d o1l gd;,T, DZ oO11 Dou1lds oftpe eal-a11al1ytic 瑟 a111izl1omia7 


瑟 om a co1miplex 1eig1ppoxrjpood or tpe ioruds {(p,dq);iTD9) = oand on tpe 
71U11IE7ICCL 11e1 妨 od 


Poof (a) In the action-angle variables (a,0), the exact How is given as 
a 四 三 a(0)，0 的 三 w(a(0))t 十 000) . (3.4) 


By Theorem IX.3.1 (and Iheorem IX.1.2), the truncated modified equation of the 
numerical method is Hamiltonian withl 


瓦 (p;,9) = 瑟 (D,9) 十 好 古 1(D9) 十 十 太 厅 Hi(D,9) 


We choose7 = 2p, and we denote by (D( 人 ,5 坟 ) the solution of the modified equa- 
tions with initial values (2zo, do). In the variables (oa,0), the modified Hamiltonian 
becomes 刀 (p,9) = Mt(a,0) with 


Tt(a,0) 三 Ta) 十 s9h(a;0) ， (3.5) 


where s = AM and the perturbation function Gn 18 bounded independently of on 
a complex neighbourhood of {a*} x Td. By Lemma 2.1 withe = hp and NV > 3， 
there is a Symplectic change of coordinates @O(AP)-close to the identity, such that 
the solution of the modifed equation in the new variables (b, p) is of the form 


汪 for 上 < 几 2， (3.6 
hn(b(O))E 二 CO0) 十 OP2N-L 十 友 PhPN) 本 人 


5 
with wzn(b) = w() 十 O(AP). The constants symbolized by the O-notation are in- 
dependent of 凡 oft < 2 and of (8(0),5(0)) with |8(0) 一 ox| < cllog 则 一 一 上 
Since the transformation between the variables (a,0) and (bp) is O(AP) close to 


the identity, it follows that the fow of the modified equations in the variables (a,O) 
Satisfies 


8 = 20) 上 OPPN) ， 


1 We always assume, without further mention, that the modified Hamiltonian is well-defined 
on the Same open set D as the original Hamiltonian. This is true for arbitrary Symplectic 
methods 让 Dis simply connected; on general domains it is satisfied for (partitioned) 
Runge-Kutta methods and for splitting methods; see Sections IX.3 and IX.4. 


416 X. Hamiltonian Perturbation Theory and Symplectic Integrators 


二 全 否 | for 1 世上 用 几 2 ， 
6 =w(z(0)tT60) 十 ten 十 O(P) 


where er = whn(b(0)) 一 w(a(0)) = O(P) yields the dominant contribution to the 
error. By comparison with (3.4) and since wa( 坟 = 了 TD 坟 ,9 的 ), the difference be- 
tween the exact solution and the Solution of the modified equation therefore Satisfies 


(bb) 一 (Pd 的 ) 一 OP) 
TU 的 ) 一 TUPo,go) = OU) 
The same bounds for 上 和 1 follow by standard error estimates. 
(b) It remains to bound the difference between the solution of the modified 


equation and the numerical solution. By construction of the modifed equation with 
7=2D and by comparison with (3.6), one Step of the method is of the form 


pn+1 三 bn 十 O(PT) ) n+ 二 1 二 wh(bn) 丸 十 pn 十 O(P+T). 
It follows that for 上 一 mA， 
太一 0 鸭 上 OU ，o= 攻 四 二 O(2j) 


Fort 艾 几 -2and7 三 2D, wehavethr 芝 /1.Hencethe difference between the nu- 
merical solution and the solution of the modified equations in the original variables 
(p,dq) is bounded by 


(nm gm) 一 CC) 一 OU) 
TCDm ga) 一 TS 一 OU 
Together with the bound of part (a) this gives the result. 


for 1 区 志 几 2. 


for 上 一 7 区 2. 














Remark 3.2. The linear error growth holds also when the Symplectic method is 
applied to a perturbed integrable System with a perturbation parameter s bounded 
by a positive power of the step Size: E 冬 玉 Pa for Some Qw > 0. The proof of this 
generalization ls the Same as above, except that posSsibly a larger N is required in 
using Lemma 2.1. 


上 xample 3.3 (Linear Error Growth for the Kepler Problem). From Exam- 
ple 1.10 we know that for the 玉 epler problem the frequencies (1.16) do not Sat- 
isfy the diophantine condition (2.4). Nevertheless we observed a linear error growth 
for symplectic methods in the experiments of Fig.I.2.3 (See also Table [2.1). This 
can be explained as follows: in action-angle variables the Hamiltonian of the 天 e- 
pler problem is (al,az)，where az = 了 工 is the angular momentum. Since the 
angular momentum is a quadratic invariant that ls exactly conserved by Symplec- 
tic integrators Such as Symplectic partitioned Runge-Kutta methods, the modiftied 
也 amiltonian 辆 
Tt(a， 0) 一 Tt(al， a2) 十 EGhn(al， Q2， 01) 

does not depend on the angle variable 02 (See Corollary IX.3.3). As in the proof of 
Lemma 2.1 we average out the angle 01 up to a certain power of s. Since we are 
Concerned here with one degree of freedom, the diophantine condition is trivially 
Satisfied, and we can conclude as in Theorem 3.1. 
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和 X.4 Near-Invariant TIori on 上 Exponentially Long Times 


We refine the results for the classical perturbation Series of Sect.X.2.2 to yield lo- 
cally integrable behaviour, up to exponentially Small deviations, over time intervals 
that are exponentially long in a power of the Small perturbation parameter. We then 
combine this result with backward error analysis to Show the near-preservation of 
invariant tori over exponentially long times in a negative power of the Step Size for 
Symplectic integrators. We begin with the necesSary technical estimates. 


X.4.1 Estimates of Perturbation Series 


We will estimate the coefficients of the perturbation Series (2.3)， which requires a 
bound for the Solution of (2.60). We use the following notation: for p > 0 and with 
| |the maximum norm on 了 及 9， 


到 ={0osTe+ 唐 4; mobl< 人 


denotes the complex extension of the qd-dimensional torus Td of width 0p. For a 
bounded analytic function 六 on Co we write 


= 邮 EO， | 人 -三 必 |， 


FEollowing Arnold (1963)，we Prove the following bounds for the solution of the 
basic partial differential equation (2.2) . 


Lemma 4.1.Sxppose w E 陈 d safisjies 1Pe Giopjpa1a1ire comdifion (2. 了 LEIG Dea 
Dox1ided 7eal-a1a1yfic Jazzction on Do ad ef G denote 加 e average of G over 氏 呈 


7je1 1pe eqguatio7m 
OP 


jas a LU1iGUe 1eal-a11a1ytic SOLUtion 忆 on Up Wii ze1o ave1age 五 =0. Forevemy 
Positive 0 < min(p,1), 已 ipoded on Up 5 py 


+G=CG 


二 DF 
Ilasmoeolcl， | 全 








< po “|Gcl，， 
D 一 0 


WwWjpereaw 一 zz 十 dg 二 1andpo 一 71842x 风 Al 二 71842xf1( 十 1)L 


Russmann (1973, 1976) has shown that the estimates hold with the optimal ex- 
ponent a 一 十 1and with ro = 24+1“V(2z)land ri 一 24 V(27 十 2)L This 
optimal value of ac would yield slightly more favourable estimates in the following， 
but here we content ourselves with the Simpler result given above. 
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Pioofof ZLeaz1ia 47， We have the Fourier series, convergent on the complex exten- 
sion ||Imoll < p， 





大 = 此 for KEZad 大 0. 


证: 


By Cauchy's estimates, |op| < Merlaoe withM = 1G-GI < 2G and 
1 旭 | 三 2 | 好 .Ifollows with (2.4) that 


AI 
Res 和 >》 | 大 |eNe-5 < | 有 ells ， 
大 


OP 
O0 











5 0 
| 
天 天 


It remains to bound the right-hand sums. We use the inequality z” /zw < ez with 
2 三 | 旭 |5/2 to obtain 


亲人 es< 2r5-o ea/2， 
达 天 


The last sum is bounded by 
|K|572 SN 0 5-1d 
一 1 2 2 ) 一 一 -一 < 8 
人 汪 4 | CE 】 站 人 ) 


Taken together, the above inequalities yield the stated bound for || 王 || 鳃 5. The bound 
for the derivative ls obtained in the Same way, with > replaced by 2 十 工 . 














The coefBcients of the perturbation series (2.9) are bounded as follows. 


Lemma 4.2. Zet 万 0, 万 1 pe real-amnalyfic da12d pot1aded py AM o7 tpe co1aPplex 7- 
1eighppbou1jpood r( 巡 ) of 及 E 月 4 and on 万 ( 姑 ) x Up，respectively Suppose 
1pat w( 妨 ) = (950o/0a)( 妨 ) satisjies 1pe diopjpantine comndiftion (2. 和 7Tje，1ne 
coe 太 cie11s of pe perttrpatio1l Se1iey (2.5) da1e Dox1aded Dy 


< Co(C17o)7 
D/2 


5) 
一 (0 
| 爱 ， 中 








Jporal7yT > 0. Pere co = 2m and CC = 128(k1AMJ/rpoa)2 wii a and Fi oF 
Ze11110 和 了. 
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Poof We recall from Sect.X.2.2 that 9j is determined by (2.0), where K1 = 万: 
andfor7 > 2， 





: 过 105F0 85 85 
本 ou: 00 ” ”00 
1 一 2 j1 十 .十 帮 ii 一 


18B /89 65， 
人 有 ( 0 】 


i1 Ki 十 .十 太一 7 一 1 





We fx an index, say .J, set 6 = D/(2.7) and abbreviate 


| 到 二 KR， 镍 o-75 


and similarly for 99/00. By (2.6) and Lemma 4.1, we have 
| 过 | me- 


We use the Cauchy estimate 











iL Dai (OUi) 么 南 | 加 ol， 
where | . | denotes the sum norm on Cd, and bound || . | by sfor 开 < 了 一 1 


We thus obtain from the above formula for 及 7 


5 
上 
一 太一 了 
和 1 
0 


1 一 1 十 .十 有 一 7 一 1 


95 5 


























1 1 


25 | 53 


























1 


Combining the two bounds yields 








画 | 有 的 


where, with 风 = (M/r)(k1/6“), we have Di = 人 andrecursively for7 > 2， 


7 全] 
太 =A>， >》， DB 二 AN 》， >》 Di 


i 一 2 有 十 .十 语 二 了 5 一 1 ja 十 .十 1 二 和 1 


Multiplying this equation with C7 and summing over 7 we see that the generating 
function 0(C) 王 带 疡 1 DiG7 is given implicitly by 


x9-K=( -50)+K( 语 -0 
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or explicitly, after solving the quadratic equation, by 


证 工 2 多 
b(C) = 六 一 ) 
(9) 2 1 十 1 用 工 十 以 
Hence, bg(G) is analytic on the disc |6| < 17/(4U(L 十 内)，and is there bounded by 
1/(2(1 十 J). For 人 二 1, Cauchy's estimate yields 


18sj/6bgly < rp < 2r (8 和 


(Forthe uninteresting case 人 < 1 thebound is 27.87-1.) For7 = Jthis almost gives 
the stated result upon inserting the definition of /, but with an exponent 2a instead 
of a. This can be reduced to aw 让 in the above proof gb is chosen as 1 = D/4in the 
first step and in the other steps as 0j 三 D/(4J). This leads to a more complicated 
quadratic equation where now 8(G) is analytic for |C| < (Ciyo)-1. We omit the 
details of this refnement of the proof. 




















For the remainder term in (2.7) we then obtain the following bound. 


Lemma 4.3. 7 1je Situation of ZLe1a1104.2 mW 职 7 和 1andJjorCiN<1/(2e)， 
4C1 re 
IlRw(e loas4Mr( -人 Ne) ， 


Proof The remainder term 忆 N in (2.7) ls asum of terms 





1 8 万 
二 部 (QQp) for 加 十 后 十 .十 大 = TV， 
Where 
区 总 OSK Oo9k+1l 如 TeN-k-193w-1 








0 矶 
Aslong as Cl V2 < 1/(2sj, we have, by Lemma 4.2， 


N=-1 
上 有 本 六: 且 0 CoCo 
了 一 有 
人 小 QI 
< Cn 冯 2-0- (六 ) (CNeJz < 2Co (CNe)z ， 

了 7 一 

This implies 

1 Du 





人 好 CNVJINV 
Oo 川 大 均 2Co(CN”) 














P/2 
for kFo 十 fi 十 .十 应 =W. CThis boundis also valid when an argument different 
from 上 appears in the derivatives of 瓦 0 and 有 Fi, as is needed for the remainder 
termas in the Jaylor exzpansion.) Estimating the number of such expressions by 


JV 2N 一 1 
NT+i-1l 2N -1N ov 
二 ?一 0 


yields the result. 
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X.4.2 Near-Invariant Tori of Perturbed Integrable Systems 


The following result extends Lemma 2.1 to exponentially long times for sufficiently 
small values of the perturbation parameterT. 


Theorem 4.4.， Zer 万 o, i; pe eal-anal1ytic on tpe comiplexr7r-11eigHhpoxrpood 克 ( 扩 ) 
of 及 E 陈 4a1nd on 奋 r( 巡 ) x Drespectiyey wiihir 芝 1andp 芭 1. Suppose tat 
w( 扩 ) =(950/0a)( 妨 ) satisjies ipe dioppantine condifion (2. 基 7Tpere are Positive 
CO11S1G1218 50，C0，C SUCH 1at 1jpe jjowing jpolds jor eyery Positive 0 < 1 and Jor 
E 近 co:1jpeeexisty aeal-a1zalytic sy1iplectic cpa1age ofcoojdizpatey (oa 0) 一 (DOD) 
SUCP 1atf every SO1U1ion (人 2) oftpe pertrped syste 太太 eew coo1dinnatey， 
S1G11i128 Wi |0(0) 一 天 | < cos22, satisjies 


12 一 0 < Ctexp(-cs-2) Jor texp(gcs 2) ， 


Perea =z7+d+landc= (16Cie/r)-Ua wii Ci ofrLenana42. Moyeover 
1pe 1ra1zg1jp7Paaa1ion 六 SUCR ta Jr(a;0) aad (bpP) 7elated py tpe abpove coo1dizzate 
17C11s1O7711， 


le- 如 和 Ce 帮 r 有 一 姑 |< cos22 pe 


D 


7Tje 1p1reshnolds so d1d co a1e SUCcHP 1Pat 计 1 iversely proportional io yYCP and co 


ipPJoportiomal 1o TOCY. 


Remark 4.S. Theorem 4.4 is a local result, showing that for bo near 久 the tori 
也 = 00，P ETedl are nearly invariant, up to exponentially small deviations, over 
exponentially long times. Nekhoroshev (1977, 1979) has Shown the 8/opal result， 
under a “steepness condition”which is in particular Satisfied for convex Hamiltoni- 
ans, that for suffticiently Small s evey Solution of the perturbed Hamiltonian System 
Satisfies, for Some positive constants 4, 已 < 1 (proportional to the inverse of the 
Square of the dimension)， 


la 一 a(o)l< se for t<exp(s 人 . 


了 emark 4.6. The constant C1 in Lemma 4.2 and constants in Similar estimates of 
Hamiltonian perturbation theory are very large, with the consequence that the results 
on the long-time behaviour derived from them are meaningful, in a rigorous Sense， 
only for extremely Small values of the perturbation parameter =s. Nevertheless, apart 
from their pure theoretical interest these results are of value as they describe the 
behaviour to be expected 夺 one presupposes that the constants obtained from the 
Worst-case estimations are unduly pessimistic for a given problem, as ls typically 
the case. 


Proofof7jpeore 44 The proof combines Lemmas 4.2 and 4.3 with the proof of 
Lemma 2.1. An appropriate choice of the truncation indices N and 7m then gives the 
exponential estimates. 


422 X. Hamiltonian Perturbation Theory and Symplectic Integrators 


Asinthe proofofLemma 2.1, we approximate 万 1(b,0) by atrigonometric poly- 
nomial of order 7n in 0. The error of this approximation is bounded by @( emP/ 2) 
on 互 -( 巡 ) x Upoja, which is O(e- >) for the choice m = 2NVp made below. By 
the arguments of the proof of Lemma 2.1, the estimates of Lemmas 4.2 and 4.3 (for 
? replaced by 7/2, which increases Ci to 4C1) are then valid in O((77m)-“) and 
O((N7rnz)-“) neighbourhoods of 因 : for a sufficiently small constant c” and with 
Co2 = 16 C1 /T， 


D97 .CA 了 7 一 炒 束 / 一 QQ 
| 受 w 中 < Co(4C172)7 for | 一 巡 ||< ch) 0Oe Zoo， 
IRw( 人 lb< 4Mr(CaNe) > for 用 一 姓 | < cv ge 
We now consider the symplectic change of variables (a,0) 一 (2 p) defined by the 


generating function 9(b, 人 . The Hamiltonian equations in the variables (0 2) are 
then of the form, for | 一刀 | <c(Vr) 


二 
Op 0O0 0p 
(4.D 
OF 


= 及 O) 一 weN( 人 TO(Nm "ec (CNo))， 
Choosing 网 =2N/pand N suchthat Co2Va = 1/(esp) gives 


D = O(exp(-ccs-2/o)) 


for | 一 大 | < cos22 (4.2 
六 = weN( 人 二 O(e- -202exp(-cs-p/a)) | 1 和 人 














with c = (Ce)-%, which yields the result. 


X.4.3 Near-Invariant Tori of Symplectic Integrators 


We return to the Situation of Sect.X.3 and apply a Symplectic numerical method to 
the integrable Hamiltonian System (3.1) with (3.2) and (3.3). 


Theorem 4.7. Co7zsider GaPPL1yig Qa Sy11Dlectic 11ierical in1egfator of ojder 7 
1O 1je 7eal-a1alytic co1iD1etely ;171eg1GD1e 瑟 a111zlio11ia1 SySte1a (3.7) UDPose 1Pat 
w(ax)] Satisjies ipe diopjpanatiae co11dition (2. 和 7Tjem tere exist Positive co11S1a1aty 
co CC a1d Po SUcP 1Pat 加 e Jo1iowmwing Hpolds Jor all step 8izey 几 去 10 ad Jor 
da1 1 生 min(pa) w 柬 a 三 了 十 Q@ 十 1 every 11ie1ical So1Utio1 SIG1108 Wi 
|7(po ao) 一 oz < cop2n satisjies 


Im) -Todol<Ci Jor 故 <exp(cj wo) 


7T71e comsta11s Ao, co,c;C depe1d on qd,T7 on pox1ads or ipe real-azaalytic 瑟 a1za11- 
io1lia1 如 om ad co1aplex 11eigjhporyjpood ortpe iors {(D,9q); TD,d) = oo aad om 
态 e 7111e1icaL 111e 态 od. 
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Prooj The proof is obtained by following the arguments of the proof of Theo- 
rem 3.1. Instead of Lemma 2.1, now Theorem 4.4 is applied to the modified Hamil- 
tonian System (3.3) with s = 2. This gives a change of coordinates (oa, 0) 一 (b O) 


O(12)-close to the identity, such that in the new variables, the solution (1( 力 ;5 人 人) 
of (3.9) satisfies 


8 所 = 加 TO(exp(-cpwWo)) for t<exp(ecjw/o . 


Onthe otherhand, using the exponentially small bound of Theorem IX.7.60, together 
with Theorem 4.4 and the arguments of part (b) of the proof of Theorem 3.1, yields 
for the numerical Solution in the new variables 


如 二 9 十 O(exp(-ch wo)) for 上 = PS exp(ch wa) . 














Together with ao 一 如 王 O(P) this gives the result， 


Remark 4.8. When the symplectic method is applied to a perturbed integrable SyS- 
tem as in Theorem 4.4, then the same argument yields for ||7(po,ao) 一 o| < co720 
with7 = max(s,M2) and 和 <1thebound 


Ian) 一 Togo 咱 <C7 for ti<exp(cmTPo) ， 


X.S 下 oljmogorov's Theorem on Invyariant Tori 


(The proof of this theorem was published in Dokl. Akad. Nauk SSSR 98 
(1954), S27-330 [MR 16, 924], but the convergence discussion does not 
Seem convincing to the reviewer.) This very interesting theorem would 
imply that for an analytic canonical system which is close to an integrable 
one, all solutions but a set of Small measure lie on invariant tori. 

(J. Moser 1959) 


It was a celebrated discovery by Kolmogorov (1954) that invariant tori carrying a 
conditionally periodic fow with diophantine frequencies persist under Small pertur- 
bations of the Hamiltonian. Together with the extenslions and refinements by Arnold 
(1963)，Moser (1962) and later authors, 人 olmogorov's result forms what ls now 
known as 天 AM theory. Here we give a proof of Kolmogorov's theorem and use 
it in studying the long-time behaviour of Symplectic numerical methods applied to 
perturbed integrable Systems. 


和 X.S.1 上 olmogorov's Theorem 


In 9ect.X.2.3 we have already given Kolmogorov's transformation which reduces 
the size of a perturbation to a Hamiltonian of the form (2.8) from O(s) to O(e2 ,at 
least formally. The iteration of that procedure is convergent and yields the following 
TeSult. 
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A.N. Kolmosgorov VI. Arnold3 


Theorem S.1 (Kolmogorovy 19S4). Comsider a 7eal-azmalytic Paailtomiazn (a, 0)， 
dejiredjpora 训 a1eigjppoxrpoodor 0 ER and0 ET9 Jporwjicpntpelinearization 
af 咏 一 0doesy71ofdepend ol 1 加 e da1zgley: 


刀 (owb)==c+w:a+garM(a0)a， (5.1) 
SUDpose 1Pafw E 陈 4 satisjies 1Ppe diopjpamtinre condifion (2.4) Viz， 
Euwl>yIET” Jor KEZ45 大 0， (5.2) 
Ga1d 1Pat tje angUiar average Jo of MI(0,) an ipvertipie dg xd7aatrixr: 
aoull > 则 ol Jor ve 了 R9， (3.3) 


Wit Positive co1lsla1nt ?1 ZLefr -(a0) = 万 (aw0) + sG(a0) pe a 7eal- 
G1G1ytic Perturpation or 万 (a,0). 7T1em 加 ee exists so > 0 suchn tpatjor every E 
Wi 玉 |s| so, tere  G11 aa1ytic sy11Dlectic tra1ts1jp7raation us : (opP) 记 (oa,0)， 
O(s) closeto tpeidenfipy anad dependizag aaatyticaliy on s wjicpn puis 妨 e perturped 
aatzlto1ia1z pack io te jor1 


已 (ab)=c+w:b+3O Neop)D Jr (oo 让 = 加 O) (6G.4) 


7T7Pe perturped syste1a 1Perejore jas 1Pe ivarianfiorusy 们 二 0, PETd carryinga 
GUGS1-Pe1iodic jiomw wii 1pe ya111e jjegue1iciey w ay 加 e UPe1tUrped yyste111. 

(7Tje 妨 reshnold so depends on diT,N ad on pot1ads of 克 and G oz a co111Plex 
1eigjhpoxrpood of{0} x T4.) 


2 Andrei Nikolaevich Kolmogorov, born: 25 April 1903 in Tambov (Russia), died: 20 Octo- 
ber 1987 in Moscow. 

3 Vladimir Igorevich Arnold, born: 12 June 1937 in Odessa (USSRI). 

4Jirgen KK. Moser bor: 4 July 1928 in Kbnigsberg, now Kaliningrad, died: 17 December 
1999 in Ziirich (Switzerland). 
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Of particular interest is the case when 五 (a0) = 五 o(a) is independent of 0， 
So that we are considering perturbations of an integrable System. In this case, the 
theorem shows that all invariant tori with frequencies w(a) = 05o/pal(a) satisfying 
($.2) and with invertible Hessian 92o/6a2(a) persist under Small perturbations 
and are only Slightly deformed. 

下 olmogorov (19534) stated the theorem and formulated the iteration of Sec- 
tion 入 .2.3, but did not give the details of the convergence estimates. Arnold (1903) 
gave afirst complete proof of the theorem for perturbed integrable Systems, Using a 
construction based on the “ultra-violet cutoff” (cf. Lemma 2.1) which yields a single 
transformation Simultaneously for all frequencies Satisfying the diophantine condi- 
tion (2.4), in contrast to Kolmogorov's iteration which yields a different transforma- 
tion for every choice of diophantine frequencies. However Arnold's transformation 
is no longer analytic in the perturbation parameter s. Moser (1962) Showed that the 
analyticity of the Hamiltonian can be replaced by differentiabijlity of suffciently 
high order Full proofs of Kolmogorov's theorem along his original construction 
were published by Thirring (1977) (for a reduced model problem) and by Benettin， 
Galgani, Giorgilli & Strelcyn (1984). 

As in Remark 4.0, a Practical difficulty with Theorem 5.1 is that the theoreti- 
cally obtained threshold so is very small. The proof below requires so 和 68” with 
Qa 一 十 4 十 1of Lemma 4.1, where bo is inversely proportional to 7. This pes- 
Simistic estimate of the threshold can be Somewhat improved by first reducing the 
perturbation of an integrable Hamiltonian System Via a perturbation Series expan- 
Sion as in the proof of Theorem 4.4 and then applying 上 olmogorov's theorem to the 
remainder of the truncated perturbation Series. 

The proof of Theorem 5.1 uses iteratively the following lemma，which refers 
to the transformation constructed in Sect.X.2.3. Similar to Sect.X.4 we use the 
notation 

lcls = supfle(wbli lal < wmbl< 颁 


for a bounded analytic function C on f， := Bv(0) x UVo, where again Bv(0) is the 
complex ball ofradius p around 0 and Lp is the complex extension of Ta of width 5. 
The same notation ls used for vector- and matrix-valued functions, In which case the 
underlying norm on Cd or Cdxd is the maximum norm or its induced matrix norm， 
Tespectively. 


Lemma S$S.2. 17 1e Situation ofsyect 和 2.3a1d zx11der tpe comaditionasy of7T1eore111 .1， 
SUDPOSe 11at 五 ad G a1e Teal-a11alytic ad Doxaded om 了 np Te 1pere exis1s 00 > 
0 SUcj 1jpat 1je jollowizg pox1nads Hold jor Ko111zogo1ov 8 1ra1zs1fo711atio1 WPe1eyver 


让 seGll < 55"， ten |e2Gll ss ( 引 )5 


1 
网 
and |sVxllo ss<5ae，-Mls<be， 


Where aw 三 Z 十 qd 十 1.77pe tprespold 600 depends on1y om dz Tand om | 五 |， 


420 X. Hamiltonian Perturbation Theory and Symplectic Integrators 


Proojf We estimate the terms arising in the construction of 玉 olmogorov's transfor- 
mation of Sect.X.2.3. For brevity we denote | .上 三 上 7sjafor7 = 0,1,2,3,4. 
(a) The transformation (b 2)  (a,0) is constructed such that (ao,0) = V(E)， 
where y(b) is the solution of = .JIVX(y) with y(0) = (0 P). Suppose for the 

moment that 
lsVxlls < 6 . (5.5) 


6 zy( 昌 一 y(0) < iofor0< 夺 <es,andinparticular 
(oa 9) 一 (bo 区 5. We define 








OX OX 
2 Re E 二 ee 
E 玉 (bpP) :一 (a 十 0 O) 0 一 OO 一 8 现 (b， p)) 


= _ ye)-y0) -<sJ7 一 VX((0)) 
and note 
1R( oO < 各 区 上 O < 圳 77Y2x TYxl 


So that 


| 民 la < 让 V2xlls1VYxlla (5.6) 


(pb) Tracing the construction of Sect.X.2.3，we find by Taylor expansion of 
妃 (ao,0) that the new matrix is 


MD， O) 二 MU O) 二 EL(D， O) 


WwWith 





dd 
有 DOM 0Xx 0M 0x 


Where 尸 (bp) is symmetric with 
大 POCO 一 MD) 元 
and where CQ(b, p) is given by (2.11). It follows that 
1 一 Ms 2e(lYMlalYxls+lY2Glo) G7) 
From the construction of G we also find by Simple estimates of Taylor remainders 


1Glas1YBlslRla+lYGlslYxlle+lY2ElslYxl .9) 


(c) Using Lemma 4.1 in the equations (2.12)-(2.16) defining X of (2.10)，we 
obtain first 


忆 9 
ol smoi ellcole， | 关外 <“lcolo 
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and by a Second application of that lemma, for z = 1,..….，d， 
lxslla 和 oo (ul 十 lola +llGalla) 


where, by construction of v and ， 


lo < ng | 到 | ， ep sehers(en 和 1Gih) 
了 一 1 


It then follows by Cauchy's estimates that 
上 Yxlss Co lcle， YXxlascCo leGlo. (5.9) 


(d) Combining the estimates (9$.60)-(3.9) and using once more Cauchy's esti- 
mates to bound derivatives of 妃 and C yields 


levxl， ss < Co-2elleGl。 
Mr- Ms < C6-2e-alleGl，. 


le Cl-s < CC6 ”eclp 


All this holds under the condition (3.3). By ($.9), this condition is Satisfited 下 


lsGll < 559 and5 < 0o with a sufficiently small 60. (Tracing the above constants 


2 


Shows that bo needs to be inversely proportional to Ai ”, or inversely proportional 


to Z.) This yields the stated bounds. 














Proofof 7jpeore1z .17. 下 olmogorov's iteration yields Sequences 


G(0 = GGGOGC) 
MD) = M MO MGC) 


XXX es。 


By Lemma 5.2 they satisfy, provided that ||sGll，= 659 with 6 < 60， 


le GO < (2 0) G.10) 
ICG+D 一 MO on < (2 70) G.1D) 
[0 G.12) 


where p0) = P 一 (1 十 二 十 .… 十 2-7)6 > dp foral7.Note that (5.11) implies 
that the inverse of MU) is bounded by 2U-1 for all 7 so that the iterative use of 
Lemma 3.2 is justified. The time-e2” flow of XGO) is a symplectic transformation 
ob), which by (5.12) satisfies 


leg 一 idllwy < (2-76)ae . (5.13) 
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The composed transformation 
雪 纪 ) 二 al0) O ab) 5 ab) 
is constructed such that 
厅 (VGT 信 阅 ) = ce 二 wb 二 MOPDTeGOO (0G.14) 


By (S.13), the sequence (2 OP) converges uniformly on 几 py2 X (一 so,so) to a 
limit We(b 2). By Weierstrass” theorem, We(b, p) is analytic in (0 2,Es) (and in any 
further parameters on which M and C might possibly depend analytically). Since 
We depends analytically on s and Wo = jd, it follows that We is O(s)-close to the 
identity on 你 py2. By (5.10) and (5.14), the transformed Hamiltonian 已 9 Us is of 
the desired form (3.4). 














X.S.2 下 AM Tori under Symplectic Discretization 


Consider a Hamiltonian System 


1 Bt 
下 2 (S.15) 


for which, in suitable coordinates (ao,0), the Hamiltonian 7t(D,9) = 五 (0) 十 
sGl(a,0) satisfies the conditions of Theorem 5.1. Kolmogorov's theorem yields a 
transformation to variables (2, P) in terms of which 


Tt(D,9g) 一 w :0 十 OAM(bP)b， 


so that the torus 思 = 也 = 0，P ETdj is invariant and the How on it is quasi- 
periodic with frequencies w. 

For a symplectic integrator of order p applied to (5.15), backward analysSis gives 
amodified Hamiltonian 7t(P,g) which is an O(j2P) perturbation of TDP, g): 


(dg) =w :2 二 HOTAM( ob 二 7PPG(O) (5.16) 
玫 olmogorov's theorem can be applied once more, yielding an invariant torus 2 
of the modiftied Hamiltonian Mt(2,q) which again carries a quasi-periodic fow with 
frequencies w. Combined with the exponentially Small estimates of backward analy- 
Sis for the difference between numerical solutions and the fow of the modified 
也 amiltonian System, this gives the following result of Hairer 儿 Lubich (1997). 


Theorem S.3. 7 1je apove situatiom, Jor da Sy1iDjectic integ1ator of order D 5ed 
Wi Scienty sl Step size 几 tere 1 0 7iodiied Paailioniaz 帮 Wi a1 太 - 
VG1ia1t 1O1U8 和 ca17y118 Ga guUaSi-peiodic Jow wy Jeguencies w， O(P) close 
10 1je iva1iant 1om3 To oj 1jpe original Pa1ailiomian 1 such 1pat tpe derence 
Dethwee1 G11y 11Ie1ical SO1Utiomn (Dn;gn) stati118 on 1Pe 1o108 人 da1id 1 加 e Sol1utiom 
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(Za 的 ) of te maodijied Panazltomia1z syste1a wii 态 e SG11ie Sta1ti118 Values 1e- 
1I01118 EXDO1E1iGL1y S111G11 27 工 / 尹 over expo1ie1tial1y 1o118 111e8: 


| gen) 一 (六 昌 , 了 9b) 咱 < Ce or 二 = 仿生 er 人 


7Tje cosla11s C ad A a1e idepe1deat ormn PE (orPssttjicienty s12a11) ad of 
ijpe zitial value (po,qo) E 元 


Proof (a) For sufficiently small 九 , Kolmogorov's theorem applied to (5.16) yields a 
change of coordinates (b 2) 上 (c 轨 ,OU(P) closetotheidentity, which transformas 
the modiftied Hamiltonian to the form 


Oo， d) 一 ww C 十 汪 c7 Me nec， 功 )c ) 


with the invariant torus 元 ,一 {c = 0,， WE Te}. The corresponding differential 
equations read in these coordinates 


e=u(eW， =w+o(cO (5.17) 


where ulc 动 ) = O(cl2 and v(cV = O(cel), and similarly for the derivatives 
Du/ac = O(cl), auw/6wm = O(cl 妆 ,and ou/ac = OU), ov/6w = @O( 人 cl). 
The constants in these O-terms are independent of 几 and <s. Let (c 人 (, 沙 ( 坟 ) and 
(CO)， 从 人) be two solutions of (5.17) such that |c 扩 | 和 B, cg < 6 0 suff- 
ciently Small) for al 上 under consideration. Then, an argument based on Gronwall s 


lemma shows that their difference is bounded over atime interval 0 < 上 <1/6by 


人 (le(0) 一 So)l+ BIO) 一 Yo) 
9 的 一 VEC 人 le 一 So 上 20) 一 (ol ， 


for some constant C that does not depend on O, 几 or <. 

(b) mm the following we denote y = (D,9) for brevity, and more specifically， 
yn denotes the numerical solution starting from any yo on the torus 元 ， lj.e., the 
c-Coordinate of Vo0 vanishes: co = 0. We denote by V( 纺 s,2)] the solution of the 
modifed Hamiltonian system with initial value V(s,s,z) = 2, and more briefly 
z 坟 =V(t0,yo) the solution starting from yo. By Theorem IX.7.6, the local error 
of backward error analysis at 万 一 J7P is bounded by 














(5.18) 





| 2 三 三 -1 51 < 0 := Comst. 彤 e 一 36/ 


for some constant 上 , as long as Vj remains in a compact Subset of the domain of 
analyticity of 1. We further denote the c-coordinates of yy V(b and yy( 坊 廊 ,27) by 
cn; ci) and ct 姓 ,V1), Tespectively. To apply the error propagation estimate (5.18)， 
we asSume that 

and for all 7 satisfying 三 三 思科 1/0. This assumption will be justified by induc- 
tion later, and the value of 0 will be specified in (3.21) below. By (S$.18) we thus 
obtain the bound 
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7 人 节 27) 一 V 攻 -VECU+L 一 力 ))6 for 考生 1/0. 


Summing up ffom7 = 1tom gives for 如 《tt<1/0(andt>2) 


信 扩 各) 一 攻 < CUL+G 雪 )5< Chr15( 人 十 细 一 经 /2) 
7 一 1 


< Cj-10 刀 < Ch-I15162 . (5.20) 


We now Set 
DB= (2C1-10)173 ， (5.21) 


sothat Ch-16162 = 8/2,and we obtain the desired estimate from (5.20) by putting 
上 一刀， 

(c) We Still have to justify the assumption (5.19). This will be done by induction. 
For7 = 0 nothing needs to be shown, because clt 0,Vo) = C 提 三 0 as aconse- 


quence of the fact that yb stays on the invariant torus Tu = {fc = 0，W E T4}. 
Suppose now that (3$.19) holds for 7 冬 m.Itthen follows from (5.20) that 


et) 川 < Cj 6 =B2 for 如 Et 1/0 
(again because of c(t) 三 0). Consequently we also have 
|easall ss en 一 各 + 志士 ro 川 <o+B/2<P， 


provided that /is suffciently small so that 0 < 0/2. By continuity, cf 如 +1, yn+1) 
is bounded by O on a non-empty interval [ 妃 +1,7T5+1il .The computation of part (b) 
Shows that | cf 恕 + yn ECB/]/2 onthis interval. Hence,T，+1can be increased 
until Tr+1l 过 116. This proves the estimate (5.19) for 7 一刀 十 |. 














X.0 Invariant TIori of Symplectic Maps 


In the preceding section, backward error analyslis combined with Kolmogorov's the- 
orem has Shown that a Symplectic integrator applied to a Hamiltonian System with 
术 AM tori possesses tori that are near-invariant，up to exponentially small terms， 
Over exponentially long times in the inverse of the step Size. To obtain truly invariant 
torl, we need a discrete 天 AM theorem for perturbations of integrable near-identity 
maps depending on a Small parameter, the Step Size. Such a Tesult was recently ob- 
tained by Shang (1999, 2000), who gave a discrete Arnold-type construction. Here， 
we use instead a discrete-time version of Kolmogorov's iteration. This establishes 
the existence of invariant tori of symplectic integrators applied to integrable Hamil- 
tonian Systems or to near-integrable Systems with 天 AM tori, for a Cantor Set of 
non-TeSonant Step Sizes. 
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X.6.1 人 A 下 AM Theorem for Symplectic Near-Identity Maps 


We consider a discrete-time analogue of the Situation in Sections X.2.3 and X.3.1 
and construct the corresponding version of Kolmogorov's iteration. Consider the 
symplectic map on : (a,0) mm (20) foranear 0 < 月 %, 0 E Ta defined by 


人 =a 一 (w 骨 ， 0 


| 
起 
十 
六 
| 
< 呈 
袜 


(6.1) 


where his a small parameter (the step size)j, and 9 : 妃 (0) xT4 一 及 is areal- 


Ps 


analytic generating function. If 9(a,0) has the form (cf (2.8)) 
5(ow 由 =c+w:a+3a7M(owga， (6.2) 
then the associated Symplectic map is of the form 
人 =a+O( 则 ol， 9=0+jo 二 Oleal) 


Hence, the torus {a = 0，0 <E T4} is invariant, and on it the map oj reduces to 
Totation by APw. 

Consider now an analytic perturbation of such a generating function: 9S(a， 分 币 
ER(a， 作 with a small =. We construct a near-identity Symplectic change of coor- 
dinates, via an iterative procedure Similar to Kolmogorov's iteration of Sect. 和 .2.3， 
Such that the generating function of the perturbed Symplectic map in the new vari- 
ables ls again of the form (6.2) with the Same w, and hence the perturbed map has an 
invariant torus on which it is conjugate to rotation by Ahw. This holds if pw satisfies 
the following diophantine condition (cf. (2.4)): 


> 六 ET for KeEZd 尖 0， (6.3) 


本 本 Ce 一 2 
用 





for some positive constants ?,Z*; and 让 the angular average Mo of M(0,.) is 
invertible: 
INooll> 必 ol for we 耿 (6.4) 


for a positive constant / 必 . As in 9ect.X.2.3，we construct a Symplectic transfor- 
mation (o, 0) 一 (bb P) as the time-s fow of an auxiliary Hamiltonian of the form 
(2.10), Viz.， 


Q 
X(oPp) = 二 .PP 二 Xo(O) 十 芝 DiXi(p) 


where E E 了 2 is a constant vector and Xo,X1)...,Xa are 2r-periodic functions. 


一 、 


We then consider the map conjugate to the perturbed map (oa,0) 上 (Q,0) generated 


by S(awb) +sR(a 0: 


一 、 


(0) 一 (00) 


(2 P) (要 
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We construct X in Such a way that the above composed Symplectic map is generated 
by 95(0D) 十 2 民 ( 六 内 with 9 of the form (6.2) and both 9 and 尺 real-analytic and 
bounded independently of s and of 屎 with (6.3). The map (D, P) 一 ( OD) is then of 
the form 


= 十 OAOUOE 全 =o+ 凤 十 O( 则 人 二 One2) 


As an elementary calculation shows, this holds 让 X satisfies for all (六 站) with 0 
near 0, 方 ETY 


几 





| 仿 


志方 一 pu) 二 2 及 =CTO() 





Where Cn does not depend on (b, DO) and s. Writing down the Taylor expansion 


地 
oO) 一 Po(O) 十 人 玉 忆 (区 十 OU 


and inserting the above ansatz for X, this condition becomes fulfilled 让 with w(OD) 三 
M(02Eandv(D) = MO 六 (oOXo/0p)( 太 一 jh)， 


Xo( 的 一 Xo( 信 一 jw) 





十 BRo(O) = PR 《0.5) 


Xi(O) 一 Xi( 方 一]o) 
用 


直 二 元 于 过 三 1 G=1. .dg (67) 





十 Li(D 十 ui( 人 十 已 (人 三 灭 十 元 十 (6.60) 


Where the bars again denote angular averageSs. We note 


_ 一 和 .jw 
工 一 e 记 


Xo( 方 ) - Xo( 方 一 jw) 泡 
邦 





庆 - 
X0O, C 
天 


where Xok are the Fourier coefficients of Xo. Under the diophantine condition (0.3)， 
Equation (6.9) is thus solved like (2.14) under condition (2.4). Equations (6.0) are 
of the Same type. The above System is then Solved in the Same way as (2.12)-(2.10)， 
yielding that the perturbed map in the new coordinates, (b, P) rm (b, D),is generated 
by 

SO 人 网 一 c 十 wp 十 友 MO( 六 十 cRO(bO 


一 : 


with unchanged frequencies w and with MU D) = MU DJ 二 O(e).The pertur 
bation to the form (6.2) is thus reduced from O(s) to O(s2). By the same arguments 
as in the proof of Theorem 5.1 it is Shown that the iteration of this procedure con- 
Verges. This proves the following discrete-time version of Kolmogorov 's theorem. 


一 、 


Theorem 6.1， Comsider a 7eal-a1zaalytic jazzction 9(a,0) oftpejoraf6.2)mwii(6. 人 让 
dejfined on ad meighpboxrjood of{0} x Ted. LerlJ < Po(fpososmzalltpatr(6.7J)5a 
Well-dejijzed 110D) G11G SUDPose 态 at Po safisjiey (6.3). 
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ZLet 59-(a， 分 一 9(a,0) 十 ER(a， 分 De a1 G11G1yfic Perturpation of 9(a;,0)， ge1- 
eati1g GSy11Diectic IaD oh sc : (a,0) > (0， 从 Via (6.1) mi 砚 9 友 Place of9. 

7je1m 1jere exisf so0 > 0 such tatjorevemy swm 识 | < so 加 ereian aaatytic 
Sy11Djectic 1a1s1p7maation Vs : (六 P) 上 (aow0) O(s) close 1o 1je identity 7 
Jiy 万 产 Satishig (6.3) amd aaalytic 太 E SUcP Pat 7 oohneowhneci(bD) 呈 
( 扩 ) 8 8e1erated Via (6 人 2y Gaction 9 (DO) wjich ns again of tje 加 7 
(G2) ie， 

人 人 (D) 一 cue 十 wD 十 Me(bD) 


71e Pertu1rped 11aap ah is 加 elejoe jas da1z 10VG1ia121 1o7US Oo wjicpn ii3 co11U8ale 10 
1olatio7z Dy Acw. 

(7T1e 如 jesjnold so depe1ids oly oa dz da1d oz Dot1adys or9 ad 尺 on 0 
comzDlex meigjhpoxrpood of{0} x Ta.) 














X.6.2 Invariant Tori of Symplectic Integrators 


Asadirect consequence of Theorem 6.1 we obtain the following result on invariant 
tori of Symplectic integrators applied to 天 AM Systems. 


Theorem 6.2. 4PPAy da 8Sy11Plectic integrator of order Dp 1o a perturped integraplie 
SySte1a Wi aaKAHM 1or THjicpn carriesy da guUaSi-periodic jiom wiip diopjpam1tine 
Jpeaguecies w. T1e1 矿 1pe step Size 屎 1 SUHciety S11G1L Ga12d Satisjies Ine stro118 
1701-1eSO1G11Ce CO11Gi1io11 (6.3) 1Pe 11aerical 11etpod jas Ga11 iva7ia1at 1O170S T 
GO(hP)-close 如 人 ,on wjicm 坟 1 co1jugate 1o 1otatior py jw 


Poof Theorem 6.1 applies directly, with s = AP, to the above Situation. Here， 


the generating function 9(a,0) of the time- 疙 how 2 of the Hamiltonian system 
with the KAM torus Tu is of the form (6.2) in the variables (oa, 0) obtained by Kol- 


一 、 


mogorov's theorem. The matrix MT(a,0) in (6.2)then differs from the corresponding 
matrix of (2.8) by O(P), so that (5.3) implies (6.4). Finally, the generating function 


一 


of the numerical one-step map G@j is an O(j2)-perturbation 9(a, 0) 十 12RR(a， 作 . 














X.6.3 Strongly Non-Resonant Step Sizes 


Theorem 6.2 leaves us with an interesting question: 让 w E 玉 4 is a vector of frequen- 
cies that Satisfies the diophantine condition (2.4), then which step Sizes 疡 satisfy the 
non-reSonance condition (06.3)? Here we give a lemma in the Spirit of results by 
Shang (2000). It shows that the probability of picking an 凡 E (0, ho) satisfying 
(6.3) tends to ] as Po 一 0. 


Lemma 6.3. Szppose w E 形 4 safisjiesy (2.4) and 1ef ho > 0. For any cjpoice of 
Posilive ?7 a11d 7 te set 


2G(jpo) 三 {E(0,j1o); doesnotsatisA (6.3) 
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8 ope1z a1d de1zge 态 (0,710) 还 人 和 Yand 六 > 二 dr 了 rr > 二 加 en tjpe 
ZLepesgtue 1ieasu1e of Go) 1 potaded by 


measure (2Z(jpo)) < C 二 而 号 


Where C depends oa1iy oz dz azd ||w||. 


Poof Itis clear from the definition that 2G(jPo) is open and dense in (0,/o). 

remains to prove the estimate of the Lebesgue measure. For every KE Zd and 

|P 和 Po, there exists an integer / 一 !( 有 , 几 ) Such that 
2 


2 
上 1 一 ee 二 全: 一 2rl 一 二 大 |: 必 一 
克 


2Tr 
Il 





FEor this / we must have, by the triangle inequality， 
2r| 咱 和 并 二 | 中 Po lo， 


So that in case7 夭 0 





1 过 Ao |lw|| 
ov 业 这 
中 ”2r( 人 (| 一 二 ) 
On the other hand, ! = 0 yields 





吧 2 
之 二 IE WwW| 之 二 7 
克 元 


1 一 一 庆 'Pw 
人 





which implies 刀 和 G(jo). Hence, 六 can be in 2G(jpo) only 证 there exist  E Zd， 
天 天 0andaninteger/ 和 0suchthat 


炒 








了 


2Tr1 TI 风 入 


天 | 世 
大 可 2 天-w 天 ， 


王 3 到 


不 人 取 十 存 一 Zr |w| 1 亿 
二 二 | 刀 + 4 | 70f+1 . 











| 入 


It follows that 


不 的 十 和 一 以 W I 人 
measure (2Z(jpo)) 和 2 >》， 3 | 于 (学 二 > ) 7+T ， 
30 ! 双 0 2 





> 











which yields the stated result. 





和 .7 上 Xercises 


1. Let 尺 beaadx2admatrix ofrank qw. Show that there exists a symplectic 20 x 2d 
Imatrix 4 such that R4= (已 @) with an invertible d x d matrix 书 . 
届 AL Consider first the case g = 2 and then reduce the general Situation to a 
Sequence of transformations for that case. 
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.04 [| error in 和 1 





eigenvalues 





5000 10000 和 ? 15000 






入 3 












error in 和 2 


error in 和 3 








Fig. 7.1. Numerically obtained eigenvalues (left pictures) and errors in the eigenvalues (right 
pictures) for the step sizes 玉 三 0.1 (dotted) and 玉 三 0.05 (solid line) 


The transformation (z,y) 一 (zy 十 dz,y)) is symplectic 让 and only 这 the 
partial derivatives of d satisfy dz = 民 ，dv = 0. 

In the Situation of Lemma 1.1, 让 (， ,ao， 人 Go)7 is another Such 
Symplectic transformation, then there exists a Smooth function 人 depending 
only onZz = (Z1,...,zZd) such that, for zj 三 上 (D,9)， 


Bo-Gog= 有 四， 





ZL Use the previous eXercise. 

Show that every discrete Subgroup of Rd is a grid, generated by 有 < d linearly 
independent VectorS. 

olLutio1. See e.g5. Arnold (1989), Sect. 10D. 

Show the following bound of the Lebesgue measure of non-diophantine fre- 
quencies (Arnold 1963): for any bounded domain (2 C 了 2， 


measure{w < 有 ; w does not satisfy (2.4) withy > dj < C(dD)7 . 


恺 AL For a fxed 5, decompose w = wo 十 aQKE/|E| with wo .天 = 0. 

Show that the eigenvalues Aj of the matrix 也 of the Toda System are first inte- 
grals in involution. 

1 For 用 三 det(X7 一 也), Show that { PP 已 三 0for al 入 ,1. 

We repeat the experiment of Fig.1.3 with the Stormer-Verlet Scheme，where 
we keep the initial values for the 9-variables, but change the initial values for 
the Dp-variables to pl1 = Da = 23 三 0. The numerical results, given in Fig.7.1， 
are qualitatively different from those in Fig.1.3. The errors behave more like 
Ac(tp) rather than jc(b. We do not understand this behaviour; do you? 

Show that for a non-Symplectic numerical method, there is at worst quadratic 
error growth in time when it is applied to an integrable Hamiltonian System. 
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Consider a numerical integrator of order Dp (ie., Bj(y) = opn(y) 十 O(PP+T))， 
and assume that 
殉 ( 思 7 = 了 十 OO) 


with g > D, when the method is applied to a _ Hamiltonian System. Prove that 
under the assumptions of Theorem 3.1 the global error behaves for 上 = AR like 


名 一 的 =OM2) 二 O( 人 ja)， 
and the action variables like 
TUn) 一 To) =OUD) 十 OU 


RemaK. Methods Satisfying the asSsumptions of this exercise are called pserdo- 
Sy1iplectic of order (D, q) (Aubry 廊 Chartier 1998). Pseudo-symplectic meth- 
ods behave like symplectic methods on time intervals of length O(PZ-9). 
Using the theory ofB-series, in particular Theorem VI.7.4, derive the conditions 
for the coefficients of a Runge-Kutta method such that it is pseudo-Symplectic 
of order D(q). Prove that there exist explicit, pseudo-symplectic Runge-Kutta 
Imethods of order (2,4) with 3 stages. 


Chapter XI. 
Reversible Perturbation Theory 
and Symmetric Integrators 


There is a very close Similarity between the behaviour of solutions of 
Ieversible Systems and that of Hamiltonian ones. 
(M.B. Sevryuk 19806, p.3) 


Numerical experiments indicate that Symmetric methods applied to integrable and 
near-integrable reverSsible Systems Share Similar properties to Symplectic methods 
applied to (Cnear-)integrable 了 amiltonian Systems: linear error growth, long-time 
near-conservation of first integrals，existence of invariant tori. The present chap- 
ter gives a theoretical explanation of the good long-time behaviour of Symmetric 
methods. The results and techniques are largely analogous to those of the previous 
chapter 一 the extent of the analogy may indeed be seen as the most Surprising feature 
of this chapter. 


XI.1 Integrable Reversible Systems 


We consider a System of differential equations on a domain of 开交 x 琢 "”， 


忆 王 Ju) 


人 


jw 一 J) 二 一 上 ww， 2) 


区 2 
WO 2 


From Sect. V.1 we recall that the time- fow wr of areversible System ls a 7eyversiple 
7111GD: 
ta U) 去 (t， 加 implies PP (ui 一 V) 过 ( 一 1) 

A coordinate transform = AU(D, 四 ,uv = (ZUV) is Said to preserve yeversipinity 让 
the relations 

HA 一切 二 AP, 

Z(2Z， 一 ) 一 Z(Z, 2 
hold for all (z, y). This implies that every reversible system (1.1) written in the new 
variables (z,V) is again reversible, and that every Teversible map (UV) 卢 (区 如 


(1.3) 
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expressed in the variables (Z,y) again becomes a Ieversible map (z,V) 请 (元 从. 
Conversely, (1.3) is necesSary for these properties. 

For Hamiltonian Systems，complete integrability ls tied to the existence of a 
Symplectic transformation to action-angle variables; See Sect, X.1. For reversible 
Systems， we take the existence of a reversibijlity-preserving transformation to Such 
variables as the definition of integrability. 


Definition 1.1. The System (1.1) is called an itegrapie reyversiple Syste1a 于 ,for 
every point (Wo,Vvo) E 下 内 x 下 ”in the domain of ( 刻 9), there exist a function 
w=(wl.,wn) :万 一 了 "andadiffeomorphism 





劝 王 (1 万 xx 下 一 TCRxR: (ao 一 (WU) 


(with 刀 and V open sets in 下 普 and 形 亚 x 下 ", respectively, and (wo,vo) E D)， 
Which preserves reverSibility and transforms the System (1.1) to the form 


w = 0 d4 
王 w(a) . 
We Speak of a 7eal-a1zalytic Integraple 7eversiple yyste111 让 all the functions appear- 
ing in the above deftinition are real-analytic. 


了 xample 1.2 (Motion in a Central Field). In Examples X.1.2 and 和 .1.10 we con- 
structed action-angle variables via a series of transformations 


X.1.5 X.1.0 X.1.153 
d1,22 7 Do Ce 也, 志 4 二 ) 也, 了 
D1, 92 4%, Dr 2 02 0027 
It is easijly verified that all these transformations preserve Ieversibility. They trans- 


form the reversible System 


骨 三 DPI， 2 三 一 42 (r)Ar 


如 三 D2，D 三 一 MA 0 
(with 7 一 V0 十 9 to the form 
万 =0， 五 =0 
人 (1.0) 


with 下 一 了 (万 , 了 and 瑟 = 下 (万 ,也 given by (X.1.12) and (X.1.13). 


As the following result shows, it is not incidental that the above transformations 
preserve TeverSslbility. 
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Theorem 1.3， Zie syitxatiomn of tpe 4Faold-Lioxyille teore1 7T1eore1z 和 1.6 1et 


1je Jrst Integ1aly 上 .Ed ortpe completely 01eg1aDie a11atliomia1z syste111 De 
SUCP Ia1 GO Fi G1e eye1l jiu1actio11s of ipe seco11d air of tpe arg11e111: 
Fi(UVU) 三 瑟 ( 一) (=1 .9d) (1.7) 


SUDPpose 1hat 0 有 /0 .OFo/6u ae iearmy idependent eyerywjpere (om 
UL :ZE 了 hexcept possipiy on a sefl 1jpat 1as 11o interior poin1s. Furtper 
0SSU11Ie 11at jprevemyZ E 也 thereexistusucpn tpar(u0) E MTpem te trans- 
jpzaationm : (oa;0) 户 (U) io action-a1gle variaples as give py 7Tjeorem 和 7.6 
PD1ese1ves 1eye1sIDILEDy 


Prooj The result follows by tracing the proofs of Lemma 和 .1.1，Iheorem X.1.4 
and Theorem 和 .1.0. 

(a) For 天 satisfying (1.7) and at points where the Jacobian matrix OFV/Ouw is 
inyertible, the construction of the local symplectic transformation 2 = (局 ,Eu， 
G1, ,Gu : (wu) m (zyV) Shows that the generating function 9(z,v) becomes 
odd in v when the integration constant is chosen such that 5(z,0) = 0. By (X.1.4)， 
this implies that 上 preserves reversibility. A_ continuity argument used together with 
the essential uniqueness of the transformation 上 (See Exercise X.3) does away with 
the exceptional points where OFV/buis singular. 

(pb) In Theorem X.1.4, the construction of e(z,y) = py(l-I(z,0)) =: (wu) is 
Such that 

e(D- 几 二 -oz0)) = (一 
This holds because by asSsumption the reference point on AM can be chosen as 
tr1(z;0) = (wo,0) for some wo, and because p+y is the time 士 1 fow of the 
也 amiltonian System with Hamiltonian V1Z31 十 .… .十 Vddd. Condition (1.7) implies 
that this ls a Teversible System, which in turn yields that e preserves TeverSibility as 
Stated above. 

(c) The transformation in the proof of Theorem X.1.6 is of the form a = (zZ)， 
4 三 到 (z)0 (with invertible 太 (z) = (z)) and hence preserves reversibility. 




















了 xample 1.4， We now Present an example with 
one degree of freedom where Theorem 1.3 does 


not apply. In fact, all conditions are Satisfied “十 
except that for Some 2Z there is no Such that 
(0) E Mz. We consider the Hamiltonian 4 > 
| | ， ， 











Eeem=O2-D2+ 人 srD4ds 叶 | 


Its level sets are Shown in the Picture to the 

Tight. For energy values Such that the level curve ee 
does not intersect the -axis, Theorem 1.3 does 
not apply even though 囊 (wuw) satisfies (1.7). 上 
For these energy values the System is an in- 


tegrable Hamiltonian System, but not an inte- 
grable reversible System . 
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了 上 xample 1.S (Motion in a Central Field，Continued). All the assumptions of 
Theorem 1.3 are satisfied for = 瓦 , za 三 了 = 三 Di 一 Dogl 让 wetake the 
Symplectic coordinates 凡 = (qi,p2) andv = (一 D1,92). 

The condition (1.7) is also satisfied with 书 = 五, 肛 王 有 (全 夭 0asalways) 
for the choices 炎 = (pl,pz) anduv = (qi,qzj,oru = (qq) andv 一 (一 D1, 一 D2). 
However in these Situations, Theorem 1.3 cannot be applied, because there does not 
exist Such that (0) E WI-. 


kxample 1.6 (IToda Lattice)，Consider the Toda lattice of Sect.X.1.5. The eigen- 
values of the matrix Zare first integrals in involution. With the Symplectic coordi- 
nates (UV) = (qd, 一 D) the Hamiltonian system corresponding to (X.1.17) satisfies 
the reversibility conditions (1.2). However Since v1 十 . .十 vn jls afrst integral of 
this System, it is not possible to connect (V) with ( 汉 一 V) on a level set Ma and 
Theorem 1.3 cannot be applied. 

Fortunately,，as can be Seen in Fig.1.1, the TIoda lattice contains many more 
Symmetries. With periodic boundary conditions it ls, for example, p-reverSsible (il.e.， 
pf = 一 /poy),y = (pq)7i ,seethe discussion in Chap.V) with 


和 


Where 9 inverses the components of a vector. To bring the System to the form (1.1) 
with a vector field satisfying (1.2), we transform 9 (and hence p) to diagonal form 
and collect the variables corresponding to the eligenvalues 十 1 and -1 in and v， 
Tespectively (See Exercise 1). This gives the (Symplectic) coordinates 





7 人 】 人 】 
UK 一 一 十 Dn- un_k+l 一 一 一 0n 5 
内 \ 万 OOK mn 一 K 十 1 也 一 大 十 工 \ 亡 dK dm 一 大 十 1 疝 
开 \ 亡 GdK dm 一 K 十 1 也 一 大 十 工 vv 了 Dm 一 K 十 1 DPK 几 


for 有 三 1 ...),/2 (fmniseven; forodd7 三 2 十 1 (1.8) holds for 天 一 1 .1 
and in addition we have We+l 三 pp+l and we+l 三 qe+1)， 

In the following we restrict our considerations to the case7 三 3, and we Show 
that all asSsumptions of Theorem 1.3 are Satisfied，so that we have an integrable 
Teversible System. For 7 一 3, the new variables are 





启 ( 十 四 ) 记 (4 一 g 
41 三 一 一 十 22 一 ) 9 一 ) 
国 -万 D1 十 D3 2 二 D2 3 d1 一 93 
(4+ 9 亏 (局 
11 三 一 一 十 22 一 ) 昌 3 过 ) 
万 dl1 十 43 2 二 42 3 23 一 刀 1 
and the expressions ak and pk of Sect.X.1.3 become 
万 (一 oj esp( 了 7 放 
al 三 一 一 天 | ui 一 23 |)， 呈 expl 二 (一 (1 十 3) 一 22 外)， 
1 2 1 3 7 DP 7 -人 1 3) 2 
Q 到 一生 D 三 二 张 (ie ( u))) 
2 一 1v2， 爷 了 了 P 了 N\Z2 \ 万 1 包 ) 
2 


os 一 一 (二 o)， b03 一 





XI.1 Integrable Reversible Systems 441 


































ss 


上 
8 
SS 
SS=Z 
沪 / 


























Fig. 1.1. Three projections of the solution of the Toda lattice equations (m” 三 3) with initial 
values as in Fig.X.1.3 


One sees that 好 十 好 and ai 好 十 as 好 are even functions ofvw, so that all coefficients 
of the characteristic polynomial of the matrix 也 


X( 和 A) = 三 中 (al 十 aa 十 a3) 和 2 一 (ala2 十 aza3 十 asal 一 好 一 妈 一 ZJ) 入 十 


(ala2a3 a102 a203 bs aa 有 十 20120223) 


are even in v. This implies that also the eligenvalues of 上 are even functions of v, SO 
that (1.7) ls Satisfied. 

Itremains to prove that for fxed Z, i.e., for given real eigenvalues of ,the point 
(&o,Vo) corresponding to p(0),d(0) can be connected with an element of the form 
(0) E 下 5 without leaving the level set M2. Equivalently, we have to find such a 
path for which the corresponding coeffcients of the characteristic polynomial X( 入 ) 
take given values. For given v( 芭 this yields a system of three nonlinear equations 
for vv(b E 了 月 3. For the eigenvalues corresponding to the initial values p(0), qd(0) 
used in Fig.X.1.3, we putv( 雪 =votforl > 寺 > 0and wechecknumerically with 
apath-following algorithm that such a connection is possible. 


上 xample 1.7 (Rigid Body Equations on the Unit Sphere). We ITeconsider an ex- 
ample that has accompanied us all the way through Chapters IV, V, and VIIL.3: the 
rigid body equations (TV.1.4), here conslidered as differential equations on the unit 
Sphere. We assume 713 < 1 1 for the inertia, which implies that any Solution Start- 
ing with ys(0) > 0 will have ys 人 加 > 0 for all . We consider the equations in the 
neighbourhood of such a solution. We can then choose 内 三 ,三 V2 as coordi- 
nates on the upper half-sphere {22 十 好 十 邮 = 1 8 > 0}. This gives the reversible 


System 
人 一 aoVIL 一 了 2 一 V? 
站 一 aouUV1T 一 2 一 2 
WwWith Q1 一 (72 as 73)/7273 > 0 and Q2 一 (7 一 厂 )/73 五 所 0， Which has 囊 一 
42/ 五 +v2/ 有 + 如一 o)/ 太 = ao 一 ai 十 万 :asaninvariant We 
introduce polar coordinates 王 7cosw%,u 一 7Sinwandexpressy as afunction of 
瑟 and w: 


(1.9) 
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厂 一 刀 
7 一 了 和 
al sin op 一 Q2 COS2 


This leaves us with differential equations 








媚 二 0， 史 二 TU OP)， 
where y ls even in 2 and has no zeros. The time needed to run through an angle ” ls 


| 2Tr 
(避风 = 元 末 而 业 ， and 4 二 二 0 


is the frequency. With 0 = w( 万 )7( 互 ,p) we then have 
妃 =0，0= w(D). 


The transformation from (,v) in the open unit disc (except the origin) to ( 妃 ;,0) <E 
(0,73 xTis adiffeomorphism that preserves reversibijlity. This shows that the 
Tigld body equations (1.9) are an integrable reversible System. 


Example 1.8 (Rigid Body Equations in 及 3). We now consider the rigid body 
equations (TV.1.4) in the ambient space 琢 3, rather than on the unit sphere. The 
System then has the invariants 刀 = 902/ 万 十 妇 / 雪 /7 and 开 三 好 十 好 十 邮 ， 
and it is reversible with respect to the partition = (Wi,ys) andv = V2. Pm the 
case 13 < 万 ,7 we can again restrict ouUT attention to V3 > 0. We then write 


= 一 V 开 一 内 一 妇 andintroduce polar coordinates Wi = cosp, ya 一 7sin 0 
As above, we expresS 7 as a function of 妃 , 人 and 2 (this just requires replacing 
13 1 with 开 / 7 in the above formula for r) and we obtain differential equations 


瑟 =0， 开 =0， 乡 =T( 可 天 9) 
with y even in pp and without zeros. In the Same way as above, this is transformed to 
刀 =0， 开 =0，0=w 人 (万民 ). 


The transformation ((i,ys),y2) 这 (( 万 , 开 ),0) preserves reversibility. The rigid 
body equations (IV.1.4) are thus an integrable reversible System. Note that this time 
the dimensions differ. 


XI.2 Transformations in Reversible Perturbation 
Theory 


We consider perturbations of an integrable reversible System Such that the perturbed 
System ls still reversible. This takes the form 
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4 王 cr(a,0) 
(2.1) 
6 = waj+ep(b 
where s is a small parameter andr is an odd function of 0 and p is an even function 
of 0: 

7(a， 人 0) aa 7(a， 0) 

Da, 一 0 一 pa,0) 
Similar to Sect.X.2 for Hamiltonian perturbation theory, we Study coordinate trans- 
formations that change (2.1) to reversible Systems which - in various ways - look 
closer to an integrable System in action-angle variables than (2.1). 


(2.2) 


XI.2.1 The Basic Scheme of Reversible Perturbation Theory 
We look for a transformation between neighbourhoods of {ao} x "， 


Q 一 0 十 ss(bD) 
0 = 好 ca(D， O) ) 





(2.3) 


Which preserves reversibility and hence has s even in % and oa odd in %, Such that 
the transformed System is of the form 


b = O(e2) 
| C 有 
六 = w( 人 二 eu) 二 O(E2) . 


JInserting (2.3) into (2.1) gives the System 
TI 0 -1(8s/8b Bs/an DAN cr(a,0) 
0 了/ ”\8c/8) 8c/pp 六) 一 wa 上 +sp(ag) 


with (a,0) from (2.3). Inverting the matrix on the left-hand side and expanding in 
powers of s, it is Seen that (2.4) fequires that s, oa Satisfy the equations 


(po = 人 CC 
0 
区 和 0 Cg 


A _ necessary condition for the Solvability of (2.3) ls that the angular average of 7 


Vanishes: 
F(b) 一 0， Where 7(Db) 三 mL 7r(b oO) do ， (2.7) 
(27)7 .Tv 
In the Hamiltonian case this condition was Satisfied because 7 was a gradient with 
Tespect to 2. Here, in the reversible case, this is Satisfied becauser is an odd function 
of 2. 
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开 (2.7) holds, then (2.3) can be Solved by Fourier series expansion in the Same 
Way as We Solved (X.2.2), provided that the frequencies wl ( 世 ,...,wn(b) are non- 
resonant. Of course, there is again the Same problem of Small denominators as in the 
也 amiltonian case. Equations (2.0) are Solved in the Same way as (2.3), Upon Setting 


AD) 元 丰 十 必 人 (站 可 四 C.8) 


Since 7 lgs odd in w, the solution s of (2.5) becomes even in p. It is determined 
uniquely only up to a constant: we are still free to choose the angular average 5(D). 
开 w'( 刀 has rank 7， we may actually choose 5(b) Such that HA(b) = 0 results from 
(2.8). Since the right-hand Side of (2.0) is even in , the solution o of (2.0) becomes 
odd in 2 让 we choose 5(I) = 0. 


XI.2.2 Reversible Perturbation Series 


The above construction extends to arbitrary finite order in s. The transformation is 
now Sought for in the form 


a 一 0 十 ssl(Dp) 十 s2sa( 六 D) 十 .十 EL1sNw_1(DOD) (2.9) 


0 -ioNw_1(b oO) (2.10) 


P 十 sol( 妨 DO) 十 soo( 力 P) 十 :十 < 


with sj even in p and oj odd in wp to preserve Teversibility. This transformation is to 
be chosen Such that the System in the new variables ls of the form 


D Se EN7rN(D， O) 
六 = wew( 人 十 ce pv(DhD) 


with we N( 人 一 w( 人 十 sh 人 (人 十 .十 ELUN_1i(and with rw(b p) oddin 
and ON(D,P) even in p, and with all these functions bounded independently of <. 

JInserting the transformation into (2.1) and expanding in powers of s, it ls Seen 
that the functions sj and oj must satisfy equations of the form of (2.5),， (2.0): 


束 人 oo = 7 人 0 
用 信 风 oO = 而 他 且 Two 人 一 W 国 CC 


Where Dj Trj are given by expressions that depend linearly on higher-order deriv- 
atives of > D and polynomially on the functions si,oi with 7 < 7 and on their 
first-order derivatives. Using the rules 


even 0dd odd 、_ 1/ odd 
odd even even / Neven 


and 
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Oeven Oodd 








一 even ， 


itis found that Dj is odd in p and ri is even in wp for all 7. For non-resonant frequen- 
cies w(b, the equations (2.11), (2.12) can therefore be Solved with sj even in %, ar 
odd in p. 开 w'(o) is invertible, we can obtain Ji(D) 三 0 for all 7. 

Beyond these formal calculations, there is the following reversible analogue of 
Lemma 和 .2.1 in the Hamiltonian case. This result is obtained by the same “ultra- 
Violet cut-off” argument as the earlier result. 


Lemma 2.1. Zetl ie 7igHt-Ha1d Side jpctio1ls of (2.7) pe 17eal-a11a1ytic 0 aeig7- 
Dox11pood of { 了 及 XT7 and Satis 有 请 (2.2) SUppose 1pat w( 姑 ) satisjies 加 e dio- 
PHa1ztiie Co1a1dition (X.2.4 For ay Jixed N 之 2 1jere are positive co1181G1115 
E0,CC SUCc1 1Patf 1je jliowing jholds jpors 芯 so: 1jere exis1g a 1eal-a1a1ytic 
1eyve1SiDility-preserving cjpa1zge of coojdizaates (a,0) 上 (DP) Suc1n tat every So- 
ZU1iom (bpP( 鸭 ) or ipe Perturped syste1a 唐 1pPe em coordizaatfes， Stati11g WitP 
12(0) 一 天 | < cllog el 一 1 satisjies 


2 的 一 MOSCte” Jor te 

le 的 一 weN(MO)E 一 2(OEC (要 十 革 logsl 人)s rr 相生 es 
Moreoyver 1je trans1formaation 1 O(E)-close io 1jpe identip (aow0) 一 (oO < Cs 
jpolds jpor(a;,0) and (bpP) 7elated py te apove coo1dizaate tra1s1oz1t /or 一 六 | 和 
cllogsl- 汪 1 andjporp 加 ans-independent complexr 1ieigjpbpoxrpood orT7". 


771e co1yta1atsy so c, C depe1d on NT Gd on po1ds of wm 0 om a co11- 
Plex 1eigjppoxrpood of{ 姑 上 x 开 7 














The equations determining the coefticient functions of the perturbation Series 
are of the form to which Lemma X.4.1 applies. Therefore, that lemma is again the 
tool for estimating the terms in the perturbation series, Similar to Sect.X.4.1. This 
yields a reversible analogue of Theorem 入 .4.4 Showing near-invariance of tori (Up 
to exponentially Small terms in a negative power of =) over time intervals that are 
exponentially large in a negative power of es，with the Same exponents wa, 0 as in 
Theorem 和 .4.4. 


XI.2.3 人 Reversible 天 AM Theory 


For an integrable reversible System, just as for an integrable Hamiltonian System， 
the phase space ls foliated into invariant tori on which the fow is conditionally 
periodic. We fix one such torus {a = ao" 0 E IT"} with diophantine frequencies 
w1,...,wn. For convenience we may assume 一 0 E 慌 .This torus is invariant 
under the flow of systems of the forma = O(lal2,0 = ww 二 OO 人 (alh, or written 
more explicitly, 

4 = 507K(ab)a 


2 (2.13) 
0 = w+AM(a0)a. 
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Here, 开 = [KE ., 开 mm] where each Ki(a,0) is a symmetric 7 x 70 matrix， 
and M(a;g is an7 xy7m matrix. The first equation is to be interpreted as ww = 
3a7 Ki(a， 0)a for the components 1 = 1,...,70. Consider now a perturbation of 
this System: 
4 = 节 aoK(awb)a+er(a,b) 
0 = w+M(awoba+esp(a0) . 


FEor the reversible case, je., for andr odd in 0gO andtfor Mandpevenin 0， 
we construct a sequence of reversibility-preserving transformations in the Spirit of 
Kolmogorov's transformation of 9ect. 和 .2.3，which transform (2.14) back to the 
form (2.13) in the new variables, Showing the persistence of an invariant torus with 
frequencies wi under Small reversible perturbations of the System. This holds again 
under the diophantine condition (X.2.4) on w and additionally under the condition 
that the angular average Mo of M ata = 0 has rank m. A result of this type 一 
a reversible 人 AM theorem - was Shown by Moser (1973), Chap.V, in a different 
Setting. See also Sevryuk (1986) for further results in that direction. 
We look for a transformation of the form 


(2.14) 


Qa 一 b+el(s(p) + 5(O) C.15) 
0 = 2 二 ea(O) 
with an mm x 7n matrix 9S(p). Preserving reversibility requires that s and 9 are even 


functions and co is odd. Higher-orderterms inpplay no role and are therefore omitted 
from the beginning. We insert this into (2.14) and obtain 


1 os 
b = 3 天 (p)8 二 ef{r(0,P) 5 


-着 0p- 囊 (Mo- 贡 (spjbjo+sorEOo' 


+O(e) 二 OGlol) 





史 = w 二 MoP)D 
+efpo 由 -各 oo+MOas(O}+oco+oclon)， 


We require that the terms in curly brackets vanlish. This holds if the following equa- 
tions are Satisfied (the last equation ls written component-wilse for notational clar- 


ity): 


Ds 

一 7(0, p) 

Our 

殉 () = 0(0,P) 二 MU0Pp)s(P) (2.16) 
09i Or; Ds; 








(2)w 三 5 2 OOM(0 人 Tsk(PJRK(O 
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Since7 is odd in ,the first equation can be Solved for s even in , uniquely up to a 
constant, the angular average 5. Since the angular average of M is asSsumed to be of 
full rank ,5 can be chosen Such that the angular average of the right-hand Side of 
the equation for co becomes zero. Since the right-hand side is even, the equation can 
then be Solved uniquely for an odd c. The equations for 9 have an odd right-hand 
Side and can therefore be Solved for an even 9. 

In this way, the perturbation to the form (2.13) is reduced from O(e) to O(c2). 
By the same arguments as in the Hamiltonian case (See Sect.X.3), the iteration of 
this procedure ls Seen to be convergent. This fnally yields a change of coordinates 
that preserves ITeversibility and transforms the perturbed System (2.14) back to the 
form (2.13). We Summarize this in the following theorem，which is the reversible 
analogue of Kolmogorov's Theorem 入 .5.1. 


Theorem 2.2， Co7zsider a 7eal-a1aalyfic 7eyversiple 8yste111 (2.13). UDPPose 1nat w E 
了 ”satisjies 1e dioppa1atipaie co1adifion (和 .2.4 da12d 1hat 加 e da118UGr average OF 
朵 (0 Banam xy7mn 71aatrix orrakm. ZLer(2.74) De da 71eal-aaalytic eve1siple per- 
17pation or 1je yyste11 (2.73) 7T1e11，1pere existy s0 > 0 (jzcjn depe1ds o1 如 e 
PetUrpatio1a Jactions opy 妇 1o8P Ga Do1d of 1Peir 11o11118 oO Q Co11DLex 11e18 凡 - 
Dor7jpood or{0} xT2) sucp tpatjporevemysw 丰 上 ll 近 so tperemareal-anaatytic 
1ra1s1o11aation We : (六 opD) mm (a0) O(e) close 1o tpe identipy and depending da1- 
Gyticaliy o1 s，WAichn Pieserves 7eveFSipility G11d PUIS 加 e PetU1rped syste11z pack 10 
态 e Jo (2.73) 训 ze new yariables: 了 = @O 人 2) 扫 =w 二 OCTpe perzurped 
SySte111 态 erejoje has tpe ivariantiorasy 人 三 0， PET7"} carrying adguasi-periodic 
Jowm miiU1je sa1te jeqgue1cies w ad38 1 太 e UDPerturped syste111. 














XI.2.4 Reversible Birkhoff-Type Normalization 


We Show that, in the Situation of diophantine frequencies w, there is a reversibility- 
preserving transformation that takes a reversible System of the form (2.13) to the 
form 


D 7kE(D， O) 
0 = ww 十 GO) 十 pk(D, P) 


withp rp 一 GO 人 (名 (2.17) 


for arbitrary 8 之 2, where 人 三 万 十 .十 友 -1 with the bars denoting angular 
averages and with pi1(b p) = Mb P)D. This implies again that the invariant torus 
is“very sticky” ||b(0)|‖ 和 6 implies | < 26 for 上 < Cr6-e+l As in the 
Hamiltonian case, a Suitable choice of 5 would even yield time intervals exponen- 
tially long in a negative power of 0 during which solutions stay within twice the 
initial distance 0. 

The transformation to the normal form (2.17) is constructed recursively. Suppose 
that in some variables (ao, 0) we have, for some 天 二 2， 


一 7j_li(a;,O) 
with 7K 1 Dk1 一 O(llal 5) 
必 十 Ck_1(a) 加 加 DPK_1(a， 0) 
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Note, for 有 = 2 wehaverl = O( 人 all by (2.13). We search for a transformation 


Q 二 十 s(D, OP) 


0 一 9 十 a(0 OP) 


with sa=O 人 >) ， 


(ands 一 @O( 人 (al2 for 天 = 2) that preserves Teversibility, i.e., has seven in po and c 
odd in 2, and is Such that (2.17) holds. Inserting the transformation into the above 
differential equation Shows that this ls indeed achieved 让 s,o solve the following 
System of the form (2.5), (2.0): 


5 2O)w = Ti O) 
O 


ev 

SS 

Sa 
| 


= Di-i(b,O) 十 ci()sGP) 一 Ab ， 


Choosing 5(b) 0 leads to /大 三 友 _1and gives (2.17) with 入 三 6-1 十 友 1. 


XI.3 Linear 上 Error Growth and Near-Preseryation 
of First Integrals 


We now Study the error behaviour of reversible methods applied to integrable re- 
versible Systems. Recall from Theorem V.1.5 that Symmetric methods are reversible 
under the compatibility condition (V.1.4). We give an analogue of Theorem X.3.1 
on the error behaviour of symplectic methods applied to integrable Hamiltonian 
Systems. We consider an integrable reversible System (1.1) (usually not given in 
action-angle variables) and let (uv) = W(a,0) be thereversibility-preserving trans- 
formation to action-angle variables. The inverse transformation ls denoted as 


(og)= Wouo edoo)) ， 
The following is the reversible analogue of Theorem 和 .3.1. 


Theorem 3.1， Co7zsider GaPPL1ying a Feversiple 11ierical integ1atorofrorderDtiotpe 
111eg1GDUe 1eyve1siple SySte11a (7.7) wii 17eal-anaalytic 718H11-Ha1d Side. UDPose that 
w(a”)] satisjiesy tpe diopjpamtiae coadition (和 2.4). 7T1em tpPere existpositive Co1181a1a15 
CCca1nd Po SUcP IJattpejoiiomwing hol1ds Jorallstep Sizes 几 去 10: every AU1ierical 
SOLU1io1 SIG11i8 Wi 瑟 |7T(uoyuo) 一 or < cllogP 一 1satisjies 


| on) 一 (wb vb) 
(un own) 一 了 Tuo,zo) 


所 CA 


过 Cj Jor 夺 = 7 由 区 7 姑 了 (3.1) 


| 
| 
7T71e co1zsta11y o,c;C depend on TD or(X2. 逢 on 1Pe dijae1sio1g，o1 Dox1dsy of 
1jpe 7eal-a1za1ytic Jinactions 户 9g oz a comap1ex 1ieigjpoxrpood ortpe lorus {(VU) : 
T(Lu) 三 0 上 azad om tpe aaerical 1zetpod 
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Prooj The proof of Theorem X.3.1 relied on Theorem IX.3.1 and Lemma 和 .2.1. 
Using their reversible analogues Theorem JIX.2.3 and Lemma 2.1 with the Same 
arguments gives the above result for the reversible case. 














Remark 3.2. As in the analogous remark for the Hamiltonian case, the error bounds 
of Theorem 3.1 also hold when the reversible method is applied to a perturbed in- 
tegrable System with a perturbation parameter s bounded by a positive power of the 
Step Size: E 所 人 Re2 for some aw > 0. 


We consider the Hamiltonian System of Example 1.4 and apply the Symmetric 
but non-Symplectic Lobatto IIIB method with step size 玫 = 0.01. In the left picture 
of Fig.3.1 we choose the initial value (wo,vo) = (0,1.5) for which the level curve 
of the Hamiltonian is symmetric with respect to the -axls and the System is an inte- 
grable reversible System. The good conservation of the Hamiltonian is in agreement 
with Theorem 3.1. In the right picture we choose (wo,vo) = (0,0.3) whose level 
curve ls the fat line in the picture of Example 1.4 which does not intersect the -aXis. 
Since in this Situation we do not have an integrable reversible System, Theorem 3.1 
cannot be applied and we cannot expect good energy conservation. 














9< .000002 上 
| [ | Sa 

.00000 250 500 .000000 50 500 
| Lobatto IIJB 
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忆 
-.00002 上 ?oauro DR 


-.000004 王 




















Fig. 3.1. Numerical Hamiltonian of Example 1.4 for two different initial values 


For the IJoda lattice example, Figures 3.2 and 3.3 illustrate the long-ttme con- 
Servation of the first integrals and the linear error growth, respectively, of the Lo- 
batto IIB method. 

Theorem 3.1 together with Examples 1.7 and 1.8 also explains the good behav- 
iour of symmetric (in fact, reversible) integrators on the Tigid body equations which 
WwWe observed in Chap. V (Figs.V.4.2 and V.4.0). 


Variable Step Sizes: Proportional, Reversible Controllers. As a consequence of 
the backward error analysis of Theorem IX.6.1 the statement (3.1) can be extended 
Straightforwardly to proportional step Size controllers as discussed in Sect. VII.3.1. 
Under the assumption of Theorem 3.1 with 彤 and Apo replaced by s and so one has 


| om 一 (ja) 诈 和 C 如 E 


for 妃 巡 E 2. 3.2 
lzGosan) -Taovaoll < Cez 全 
The grid { 妃 } is determined by the method and satisfies 怒 +1 三 刀 十 ss(Wny UnyE). 


Variable Step Sizes: Integrating, Reversible Controllers. We apply the backward 
error analysis of Theorem IX.0.2. The modified equation (IX.0.14) reduces to 
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RN 册 
5000 10000 15000 Du 


Fig. 3.2. Numerically obtained eigenvalues (left picture) and errors in the eigenvalues (right 
picture) of the 3-stage Lobatto IIB scheme (Step Size 咏 = 0.1) applied to the Toda lattice 
with the data of Sect. 和 .1.5 
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Fig. 3.3. Euclidean norm of the global error for the 3-stage Lobatto IIB Scheme (Step Size 
玉 三 0.1) applied to the Toda lattice with 即 = 3 and initial values as in Fig. 3.2 


= 帮 y)， 之 一 2CG(V) (3.3) 
业 


fore = 0.Since G(U) = -(c(J) Va(y)7 AGO) with an analytic step size func- 
tion ol， the function (yz) mm zol(Vy) is a first integral of (3.3). Suppose now 
that y = jy) is the integrable reversible system (1.1). This means that there exists 
a Ieversibility preserving diffeomorphism y = W(a,0) transforming the System to 
action-angle variables. The diffeomorphism 


局 = Wo 4,0) = 二 


is then also reversibijlity preserving 这 cl(uw, -v) = al(wu), and it transforms (3.3) to 


4 = 0， 4=0， 0 = w(a). 
Ithebasic method of the algorithm (IX.6.9) is reversible and if olu, -vv) = al V) 
holds, the modifed equation (IX.6.14) ls a reversible perturbation of (3.3). Conse- 
quently, Theorem 3.1 yields the statement (3.2) also for integrating Step Size con- 
trollers. Since 4 := zoluwV) is an action variable, we have in addition that 


zolunyun) 一 200(uo,uo)| 挟 Ce? 


for 如 冬 s ?2. Notice that the transformation (2.9) is O(sP)-close to the identity 
for the variables a and 0, but only O(e2)-close for 4. This result proves that the 
integrating step Size controller ls as robust as the proportional controller. It also 
explains the excellent long-time behaviour observed in Figs.VJII.3.2 and VIJIIU.3.3. 
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XI.4 Invariant TIori under Reversible Discretization 


In this Section we Study the question as to how invariant tori of reversible Systems 
are preserved under discretization of the System by reversible numerical methods. 
We give ITeversible analogues of Theorems 和 .5.3 and 和 .0.1. 


XI.4.1 Near-Invariant TIori over 了 上 xponentially Long Times 


We conslder a reversible System (1.1) which in suitable coordinates takes the per- 
turbed form (2.14). Under the conditions of the reversible 开 AM theorem,， Theo- 
rem 2.2, this System has an invariant torus carrying a quasi-periodic fow with fre- 
quencies w for sufftciently Small =s. Consider now a reversible numerical integrator 
applied to this System. By the Same arguments as in Sect.X.3.2, Using the reversible 
KAM theorem 2.2 in place of Kolmogorov's Theorem X.3.1，we obtain the fol- 
lowing analogue of Theorem X.3.3，which states the existence of a torus Such that 
numerical Solutions starting on this torus remain exponentially close to a quasi- 
periodic fow on that torus over exponentially long times in 1/ 几 . 


Theorem 4.1， 7Z 加 e apoyve situatiom, Jor ad 1eyversipie 71U11erical 11etpod of order 7D 
SeEd Wi Scienatty S11G1L Step Size 几 加 ee 好 G711odijied Jeversiple Systema with da1 
ivG1ia1it ior28 Tca117yiz8 aqgUaSi-periodic Jow mi Jeguenciegs w O(P) close 
10 态 e iva1iat ioFds To or tpe original reversiple sySte11， SUCL tat 1pe dijjerence 
Dethwee1 Ga1y 11U1ie1ical SO1UOH (Un Un) Sta1tiza8g on te 1orusy 人 a1d 加 e solution 
( 尼 人 5 扩 ) of ie 71aodiied ailtomiam syste1a wii tpe sa1ae startizag values 1e- 
1I01118 EXDO1E1iGL1y S111G11 7 工 / 尹 over expo1ie1tial1y 1o118 111e8: 


| oa 一 (TD < Ce Jor = 区 es 


7Tjpe co1sta1t C ad Ka1e idepe1dest ojs(1orPEssSztjicientiy SG10) aad oftpe 
态 友 GaL Value (UV0) E 元. 














The case of initial values lying close to, but not on 元 ,can again betreated by a 
Ieversible analogue of Theorem 和 X.4.7. 


XI.4.2 A 下 AM Theorem for Reversible Near-Identity Maps 


To obtain truly invariant tori，we need a discrete analogue of the reversible 上 AM 
theorem,， which is derived in this subsection. This result can also be viewed as the 
Teversible analogue of Theorem 和 .6.1. It establishes the existence of invariant tori 
of reversible integrators, but as in the Symplectic case, only for a Cantor set of non- 
TeSonant Step SlZeS. 

AmapS5:(a0) 一 (0， 作 has the invariant torus {a = 0,，0 Ee 菩 } ,andreduces 
on this torus to rotation by Apwo (Rareal parameter and w E 及 "”),， when it is of the 
form (cf. (2.13)) 
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= Q 十 直 pa7K(a0)a 
= 0 十 ji 十 PAM(a,0)a . 


Here, 开 = [天 , 开 m] where each Ki(a,0) is asymmetric mm x 7m matrix, and 
1M(a,0) is an7 xm matrix. The expression in the first equation is again to be 
interpreted as aL Ki(a,0)a for the components 1 一 1 ... ,77. 

A necesSsary condition for the above map 2 to be 7eyversiplie with respect to the 
involution (oa,0) m (一 外 ,cf Deftinition V.1.2, is seen to be 


(4.1) 





民 ) S) 





开 (0, 下 = 一 开 (0,0 一 Pu) d9) 
MI(0,-0 = MI00,0-jPo). 
Consider now a perturbed map 
下 = 十 3pa7K(a0)a 十 尹 sr(a)g) C43 
0 = 0+Hh 二 Paga 十 几 Esp(a;O 


whereyr and 0D, which like 人 and M are assumed real-analytic, might depend ana- 
lytically also on 六 and s. Reversibility of this map implies, by direct computation， 
that in addition to (4.2), the following equations are satisfied up to an error O(AE): 


7(0, 一 0) = 一 r(0,0 一 Po) 
Dr Dr 
吉 人 一作 = 一 元 (0.0) (4.4) 


0(0,--0) = 0000--Ppo) 一 APAMM(00-- Po)r(000 一 Po). 


Similar to Sect.XI.2.3，we construct a teversibility-preserving near-identity trans- 
formation of coordinates (a,0) 一 (bp) such that the above map Ge in the new 
variables is of the form (4.3) with the perturbation terms reduced from @O(s) to 
O(c2). Similar to Sect.X.6.1, this is possible 计 huw satisfies the diophantine condi- 
tion (X.6.3) and if the angular average Mo of M(0,.) has rank 7. 

We look for the transformation in the form (2.15). The functions defining this 
transformation must Satisfy the following equations, cf. (2.10): 


s(P 十 Phw) 一 SO) 














二 7(0,P) 
(2 十 。， 0 
9ij(P 十 、 一 Sr(P2) 元 (o-》、 全 (P)Mji(0, 四 ) 
+ sk(O)Kaej(0,O) 


Under the conditions (X.6.3), (X.0.4) these equations can be Solved by Fourier ex- 
pansion, in the Same way as the analogous equations in Sections X.0.1 and XI.2.3， 
and the map in the variables (b, 2) becomes of the form 
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= 8 二 32K(2P)b 二 Oncol2) + O(Rns?) 


4.6 
= oj 十 PM(PD+OUell) 上 Ons2) ， 9 


G) 避 ) 


We still need to know that the change of variables (oa, 0) 上 (b P) preserves Te- 
versibijlity, ji.e., that s and 9 are even functions of po and c is an odd function of 
0O. This ls indeed a consequence of (4.2) and (4.4). (We may modify r and p Such 
that (4.4) holds exactly, at the expense of introducing additional O(P2e2?) perturba- 
tions in (4.3).) Let us Show this property for s. The Fourier coefficients sk of s must 
Satisfy 


CKPw 二 可 
Since (4.4) Imnplies r = 一 rke- 久 nu forall ,itfollowsthats 人 一 shandhence 


sis an even function of o. Similarly it is shpown that 9 is even and co is odd. 

In summary, we have found a transformation @O(s) close to the identity, which 
transforms the reversible map (4.3) to a reversible map (4.0), thus reducing the per- 
turbation terms from O(e) to O(s2). The iteration of this procedure can again be 
Shown to be convergent. This fnally yields a transformation to coordinates in terms 
of which the perturbed map is back in the form (2.13). In this way we obtain the fol- 
lowing discrete analogue of Theorem 2.2 or reversible analogue of Theorem 和 .0.1. 


Theorem 4.2. Comsider a 7eal-a1z1alytic 7eve1siple 11aD 91 or1pe Jo (4. 3 de- 
Jiped on a71ieigHhpoxrjpood of{0} xT wii0E 了 7 Suppose 1pat ho satisjies 1he 
diopja1atize co1adition (X.6.3) ad 1Pat tpe a118UIar ave1age or MI(0,) jas 7a1K 刀 . 
7Tjem 1pPere existy so0 > 0 Suc1 tatjorevemy swi 太 | < so terezareal-anaiytic 
1ra1s1p11aation We : (0PD) 记 (ao0)，wjicp preservesy Teversipilipy ad 1 O(5) 
close 10 1jpe identipy 2 加 may 下 六 SG18jyin8g (和 6.3) Gd 1 CGO1C 六 s SUCP 芒 at 
0 oggGjpsowjisi 人 (ip) 记 (ODNagain oftjpejora (4 一 二 OO 人 (2)， 
护 =V 十 Ai 十 OO 人 (由 Te pertrped aap Ge 加 erejoje jas az ivariantiorus o1 
WPicp ii cojzgate 1o 1olafio1z py 几 w. 














As in the analogous Situation of Sect.X.0.2, Theorem 4.2 applies directly， with 
E 三 /2, to the Situation where a reversible numerical method of order P is used 
to discretize an integrable reversible System，or more generally,，a reversible SyS- 
tem with a 天 AM torus with diophantine frequencies w. Here (4.1) corresponds to 
the time- 几 flow of the reversible System, and (4.3) represents the numerical map. 
This establishes the existence of invariant tori for feversible integrators, in perfect 
analogy to the Symplectic counterpart Theorem 久 .6.2. 

Concerning condition (X.0.3) werefer backto Sect.X.0.3, where it is Shown that 
this condition is satisfied for a Cantor set of step Sizes 丸 让 w satisfies the diophantine 
condition (X.2.4). 


XI.S 卫 Xercises 


1. This exercise Shows that reversibility with respect to the particular involution 
(u) F (u 一 v) is not as Special as it might seem at first glance. 
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(a) If the system 7 = jy) is po-reversible (ie.,， Foy) = 一 JW)), then the 
transformed system 之 一 了 -LTz) is o-reversible with co = 人 -107. 

(b) Every linear involution (02 = 门 is similar to a diagonal matrix with en- 
tries 十 1. 





.Consider the IJoda lattice equations with an arbitrary number 7 of degrees of 


freedom and with periodic boundary conditions. 

(a) Find all linear involutions po for which the System is D-Teversible. 

(b) Study for which p the eligenvalues of the matrix 了 are even functions of V. 
(c) Investigate (numerically) the set of initial values for which all the assump- 
tions of Theorem 1.3 are Satisfied for some involution 0. 

ZL Generalize the discussion for 光 一 3inthe Example 1.0. 


. Areversible System of the form 


wu 一 0 

0 = w(a,0 
with w an even function of O E IT, also has a foliation of invariant tori. Con- 
Slider reversible perturbations of Such Systems like in (2.1) and search for a 


reversibility-preserving transformation (2.3) that takes the perturbed System to 
the form 


= O(c) 
2 = wp)+snuP)T+O() 


with /even in 2. Write down the partial differential equations that the transfor- 
mation must Satisfy and discuss (Suffcient) conditions for their Solvability. 


.The torus {a = 0,0 Ee 要 is invariant and carries a conditionally periodic 


flow with frequencies w for reversible systems of the forma = O(lal), 6 = 
w 十 O(|al, which is more general than (2.13) in the differential equation for a. 
Discuss the difticulties that arise in trying to transform a reversible perturbation 
of Such a System back to this form. 


. Apply an arbitrary (non-symmetric) Runge-Kutta method of even orderDp = 2 


to an integrable reversible System. Prove that under the assumptions of Theo- 
rem 3.1 the global error behaves fort = 7 尹 like 


甸 一 8 的 王 OU 十 O( 人 (PPT )， 
and the action variables like 


Tg) 一 To =OUODD 二 OGA)， 


Chapter 和 X 工 . 
Dissipatively Perturbed Hamiltonian 
and Reversible Systems 


Symplectic integrators also Show a favourable long-time behaviour when they are 
applied to non-Hamiltonian perturbations of Hamiltonian Systems. The Same ls true 
for Symmetric methods applied to non-reversible perturbations of reversible SyS- 
tems. In this chapter we Study the behaviour of numerical integrators when they are 
applied to dissipative perturbations of integrable Systems,， where only one invariant 
torus persists under the perturbation and becomes weakly attractive. The Simplest 
example of such a System is Van der Pol s equation with Small parameter, which has 
a Single limit cycle in contrast to the infinitely many periodic orbits of the unper- 
turbed harmonic oscillator. 


XI.1 Numerical Experiments with Van der Pol's 
了 上 quation 


One of the first such methods is the method of Van-der-Pol. [.…] It should， 
however be noted that in the formulation given by Van-der-Pol, approxi- 
mation was effected by Simple intujitive Teasonings. 

(N.N. Bogoliubov 多 YA. Mitropolski 1961, p. 10f.) 


Consider Van der Pol's equation 
= 一 gg+s( 一 02 
4 二 edL 一 多) 4.D 
4 一 了 


with small positive s, which is a perturbation of the harmonic oscillator. A Symplec- 
tic change to polar coordinates D = V2acos0 qg9=V2asin0 puts the System into 
the form 


s2acos20(1 一 2asin20) 
0 = 1+scosgsinb(l 一 2asin20) . 


Since the angle 0 evolves much faster than w，we may expect that the averaged 
Syste11, Which replaces the right-hand side functions by their angular averages, gives 
a good approximation: 
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au = sa(1-a) 
0 = 工 . 


Approximating by the averaged equation ls the“method of Van-der-Pol”cited 
above, and the belief in the long-titme validity of such an approximation is the ve 六 
Ga8g118 DJiiciplie. The averaged differential equation for wc has an unstable equilibrium 
at zero, and an asymptotically stable equilibrium at au” = 2. The averaged System 
therefore has the circle { 必 = 2,0E 有 mod2rlasanattractive limit cycle. This 
Suggests that the original Van der Pol equation has a nearby limit cycle，which is 
indeed the case. 

Following the numerical experiment of Hairer 儿 Lubich (1999),， we Solve the 
equation (1.1) with two initial values, (zo,qo) = (0,1.3) and (po,9qo) 三 (0,2.7)， 
and with three numerical methods: the non-Symplectic explicit and implicit Euler 
methods, and the symplectic Euler method. All of them have order 1. The numerical 
Tesults are displayed in Fig.1.1. For large Step Sizes (Compared to the perturbation 
parameter s), the non-Symplectic methods give a completely wrong numerical So- 
lution，whereas that of the Symplectic method is quajlitatively correct. For smaller 
step Sizes, the numerical solutions of the non-Symplectic methods also Show a limit 
cycle. 

For the moment we explain these observations by “simple intuitive reasoningsS”， 
that is, by the averaging Principle and formal backward error analysis. The rigorous 
treatment is developed in the course of this chapter in a more general framework of 
perturbed integrable Systems. 

For a differential equation 


= jy) +s9g(y)， 


the numerical solution yn obtained by the explicit Euler method is the (formally) 
exact Solution of a modified differential equation 


= 7 信 十 cg( 信 一 反 太 及 /+O( 人 2 二 sj 


For the Van der Pol equation in the above coordinates, the averaged modified equa- 
tion becomes 
G = ha 二 sa(1l 一 二 0) 十 


which has approximately w = 2 十 21/s as an equilibrium. Hence, the limit cy- 
cle of the numerical solution of the explicit Euler method has approximate radius 
2V1 二 jh/s (Fig.1.1) which is far from the correct value unless 刀 < 祥 2. 

The implicit Euler discretization is adjoint to the explicit Euler method. There- 
fore, its modified differential equation is as above with /replaced by 一 /. In this 
case, the radius of the limit cycle is approximately 2V1 一 jj/s (for < s), which 
again agrees Very well with the pictures of Fig. 1.1. 

For the symplectic Euler method, the modified differential equation for Van der 
Pol's equation is 
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EC explicit Euler 一 RN 


implicit Euler, 尹 二 0.15 











N 

















Symplectic Euler， 彤 一 0.02 


ES 


























Fig. 1.1. Numerical experiments with Van der Pol's equation (1.1), s = 0.05 


方 = 一 Ye 人 (一 部) 六 十 所 方 + O(12 十 sj) 
7 = 方 - 所 +O(02 +eh. 


Here, the modiftied differential equation for the unperturbed harmonic oscillator ls 
也 amiltonian (Theorem IX.3.1), and so all s-independent termas in the averaged mod- 
ified equation vanish: 
2 
0 万 ; 
一 一 (a0)ad0=0. 
0 90 (Q， ) 

Therefore, the radius of the limit cycle is of size 2 十 O(P) in accordance with 
Fig. 1.1. 
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XIL2 Averaging Transformations 


Le problsme des oscillations non linkaires a actuellement une grande 
importance dans les domaines les plus divers de la technique et de la 
physique. Parmi les meEthodes analytiques deEtude des oscillations non 
linkaires, la mekthode asymptotique de dk6veloppement en skrie par rap- 
port a un parametre petit est particulierement efficace. Toute une SErie 
de monographies publikes en 1930-1938 par N. Krylov et N. Bogo- 
lioubov tant en russe qu"en francais ont EtE consacrkes ja cette ques- 
tion, malheureusement ces ouUvIages Sont devenus aujourd'hui des raretes 
bibliographiques. Par ailleurs les meEthodes exposkes ont ktk largement 
developpkes depuis. 

(N.Bogolioubov &I. Mitropolski 1962, prEface alatraduction francaise) 


In this section we conslider rather general perturbations of integrable Systems. We 
study transformations that eliminate the dependence on the angles in the pertur- 
bation functions, up to arbitrary powers of the small perturbation parameter. The 
construction and properties of these “averaging "transformations are obtained by a 
Slight extension of the arguments in Sections X.2 and XI.2. 


XII.2.1 The Basic Scheme of Averaging 


As in Sections X.2.1 and XI.2.1, we consider perturbations of an integrable System 
written in action-angle variables: 


Rs Er7(a,0) 


C.D 
0 = w(a)+sp(a0) 


where sis asmall parameter and 7, p are real-analytic in aneighbourhood of {a* } x 
T4. Unlike the situation of the previous chapters, we do not impose conditions that 
make the angular average 





了 
有 的 王 0) qd0 2 
ra -人 ro CD 
vanish identically. We look for a transformation to new variables (b, 2), of the form 
一 十 ss(b， 
(2 号 ) oa 
0 = 三 2 下 sa(b， O) ) 
which eliminates the dependence on the angles in the O(s) terms of (2.1): 
D = 人 十 O(e2 
s7m0(b) 十 @O(E) 用 


六 = w( 人 +TeHn(O 二 O(c2) . 


This is just a minor modification of the problem in Sect. XI.2.1. The equations that 
s and o must satisfy, differ from (XI.2.3) and (XI.2.6) only in that the right-hand 
Side 7(b 2) of (XI.2.5) is replaced by 7(b 2) 一 7m( 世 ,viz.， 
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完 0 C3) 
东 ( 虽 册 六 Cg 


Necessary conditions for solvability are now 
mm( 切 二 7(O， AU 二 拟 D) ， C.7) 


Where the second equation corresponds to the choice 5(Db) = 0. mn other words, the 
leading terms in (2.4) are the angular averages of the perturbations in (2.1). 

The equations (2.5), (2.6) are Solvable for bp = 久 让 w(0) satisfies the dio- 
phantine condition (X.2.4). The “ultraviolet cutofft ”argument of the proof of Lem- 
ma X.2.1 then Shows that (2.4) holds uniformly as long as the solution remains in 
the ball | 一 巡 || < cllog sl 一 with asufficiently small constant c. This may hold 
over a very long time interval 过 the equation 一 <s70(D) has a stable equilibrium in 
that ball. 


XII.2.2 Perturbation Series 


As in Sections X.2.2 and XI.2.2, the above construction extends to arbitrary finite 
order in s. Atransformation of the form (XI.2.9), which eliminates the angles in all 
termas up to order sN-1l, is sought for: 


b = sal(D) 十 s27na( 思 十 .十 EN-ImN_1( 人 十 ENrN(bO) 


2.8 
交 妆 而 阁下 和 大 仙人 (0 下 玖 和 和 全 NOR。 


The equations determining the transformation are a slight modification of (XI.2.11) 
and (XI2.12): on the right-hand side of (XI.2.11), pj(b, p) is replaced by the differ- 
ence Dj(b, P) 一 ?nj(D)， with rz(b) = 万 (o). We then have the following variant of 
Lemmas 久 .2.1 and XI.2.1. 


Lemma 2.1. Zetl 11e 7igHt-Ppa1d Side jpctio11sy of (2.7) pe 17eal-a11a1ytic 记 a 711eig7- 
popood or {z} x Td4. Suppose tat w( 姑 ) safisjies tpe dioppantine comdition 
(X2. 和 wii expone1t .Foraiy Jixed N > 2 1jere de posifive co1sta11ts s0, CC 
SUCP 1Pat tje Joliowing jpolds fjor |s| 冬 so: tpere exists da Jeal-a1na1ytic cpa1zge of 
coo1dipatesy (a,0) 请 (0 2) wjicp ta1as1o7ras (2.7 太 1 (2.8) mi 


lzoS CA SC 


or 5 一 基 | <5， 
Ilrw@o< CA5x-，lpwol<cix- 和 用 -1 


Wje1e 
6 一 cllogsl 一 !. (2.9) 
Moreoyver 1pe tra1as1formaation 1 O(E)-close io te identip lowg9) 一 (oO < Cs 
jpoldsjJor(a,0) and (bpP) 7elated py te apove coo1dipnate 1ra1zs1orzza 加 or 一 六 | 忒 
6 a1nd jpro 训 az s-independent complex meigjhpoxrpood ofT4. 
771e co1syta11s so c,C depe1nd on Nd7y,Z ad ol pou1ndsy ofw, 7 0 o1 Ga co11- 
Pilexr mneigjhpoxrpaood of{ 姑 } x 4. 
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Prooj The proof uses again the ultraviolet cutofft argument of the proof of Lem- 
maX.2.1.This makes all the functions si ci;,770iyHi real-analyticinp for | 一 巡 | 么 
20 and of p in an es-independent complex neighbourhood of T4. The powers of 5 记 
the denominators of the estimates come from the presence of terms sj/00, Dj/00 
in Di(b, P) and Ai(b P) of (XI.2.11) and (XI2.12) and from Cauchy's estimates 
applied to sj), oj on | 一 巡 || < 20. 














XIIL3 Attractive Invariant Manifolds 


Theorems on invariant manifolds for maps have been proved many times 
for many different settings. The first results were obtained by Hadamard 
(1901) and Perron (1929). [.…] Our aim was to derive a global invariant 
manifold result with conditions that are easy to verify for the applications 
in mind. (K. Nipp 儿 D. Stoffer 1992) 


In this section we give results on the existence and properties of attractive invari- 
ant manifolds of maps, with a very explicit handling of constants. These results are 
due to Kirchgraber,，Lasagni， Nipp 氏 Stoffer (1991) and Nipp 多 Stoffer (1992). 
They will allow us to understand the weakly attractive closed curves that we ob- 
Served in Sect.XI.1. Beyond that particular example, these results are extremely 
Useful for studying the long-time behaviour of numerical discretizations in a great 
variety of applications; See Nipp 久 Stoffer (19935,，1996) and Lubich (2001) and 
references therein, and also Stuart & Humphries (1996) for a related invariant man- 
ifold theorem and its use in analyzing the dynamics of numerical integrators for 
non-Cconservative problems. 

Consider amap :人 xz 一 入 XYdefnedonthe Cartesian product of a 
Banach space X and a closed bounded subset Y of another Banach Space. We write 
SG(z,Vy) = (人 妨 with 


Z 十 jg 
= 92 切 . 
We assume that 厂 and 9 are Lipschitz bounded, with Lipschitz constants 了 zz 也 zy 
and Loyz, ZL with respectto 7, 0. 于 these Lipschitz constants are sufficiently small， 
then the map has an attractive invariant manifold. More precisely, there is the fol- 
lowing result stated without proof by Kirchgraber Lasagni, Nipp 多 Stoffer (1991) 
and proved in a more general setting by Nipp 久 Stoffer (1992). 


(3.1) 


局 


Theorem 3.1， 态 加 e apove Situatiom, 矿 


万 二 丽人 直 5 (G3.2) 


1jem 1jee eXis18 GJU1ctio1s :及 一 了 WPichnisZLIPDSscHPi Doxaded wii tpe co11sta11t 
入 三 271oz/(1 一 Daozr 一 工作 )， SucH tpat 


人 ={(zs(zZ)) :ZEX Divariantznader 5. 
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人 atlpacts orpitg ofE mw 1e attractimipy jaoctor pn = ALzy 十 Zi 广 工 1pati， 
从- s 全 | < py 二 sonolisjoral (及 E 生 x 


Pioof (a) We search for a function s: X 一 了 suchthat for (2 人 功 =@(zy,the 
relation y = s(Z) implies also 人 = s(2). For an arbitrary function oa : X 一 了 ， 
we first study which relation holds between 和 and 证 y = al(z). To write as 
a function of 人 we need a bjjective correspondence between Z and Z via the first 
equation of (3.1). By the Banach fxed-point theorem, the equation 


候 王 2Z 十 zac(z)) has aunique solution zz = we(2) 


for everyZE 和 让 2 jza(z)) is a contraction. This is the case 让 ce has the 
Lipschitz constant 入 and 
Lazz 十 Lazy 入 <1. (3.3) 


We then obtain 人 = 5(Z) from the following Scheme: 


一 : 


2 一 ur(D) 一 


|。 


三 Ca(Z) 


SS 


”人 92 

That is, we set = 5(Z) = qg(uo( Dalue(Z))). By construction, (2 人 三 (Z,). 
Under condition (3.3), the function wu : X 一 Xi Lipschitz bounded by / 三 
1/(1 一 Zazz 一 工 zy 和 ). Consequently, the functionG :X 一 了 isLipschitz bounded 
by (Ziz 十 工 mwA 和 )N. The condition that the transformed function C is again Lipschitz 
bounded by the same 和 as ao, therefore reads 


Zyz 十 也 yy 入 
人 .4 
0 人 


or edquivalently， 
Ta 一 (1 一 oz 一 Toy)A 十 Do 和 0. 


Under condition (3.2), there exists a non-empty real interval of values 和 Satisfying 
this quadratic inequality. In particular, (3.4) then holds for 
2 了 


(3.3) 


六 
TS 


(This is close to the Smallest possible value of 入 证 2VZayZoz < 匀 1 一 Laoz 一 站) 
It is easjly checked that (3.2) and (3.5) imply (3.3). 

Under conditions (3.3) and (3.4), the transformation 妃 : ac > 5, which is called 
a adazaza1d gs10D1 1ra1lsforl maps the set of functions 


93={c:X 一 了 | cisLipschitzbounded by 入 } 


into itself, lj.e.， 
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万 :9 一 9:0P 记 0: 


9 is a closed subset of C(X,Y), the Banach space of continuous functions from 
X to the bounded closed set 了 ,equipped with the supremum norm |lcll。 = 
Supzcx ||c(z)|. 下 五 ia contraction, then the Banach fxed-point theorem tells 
us that there is a unique function s E 9 with $ = 5. By construction，this 
means that 证 (了 信 = @(2,Vy) andy = s(zZ), then also 人 姓 = s(Z). The graph 
人 ={(zs(zZ)) :ZEXTisthen aninvariant manifold for the map 95. 

(b) We now Show that 囊 is already a contraction under condition (3.2). Let 
ao, al be two arbitrary functions in 9, and7Z e X. With zi = We; (了 )， 


|ci(2) 一 瑟 co(2)| = (ztoa(z)) 一 9(zooo(Zo)) 
和 gza'ci(z)) 一 gzioo(zZi)) 川 十 lgGzcayoo(zi)) 一 gzoao(Zo)) 





和 Zoollcr 一 colw< +T(oz 二 DoA za 一 zol . 


By deftinition,Z 王 Zr 十 jcioi(zZi)) fori 一 0,1.Subtracting these two equations 
yields similarly 


za 一 zol g | oatzi)) 一 Aczoao(zo))l 
和 | jcaoici)) 一 zco(zi)) 二 | Acayco(z)) 一 7zoao(Zo))l 
< Zazyllcl 一 coollw + (zzz 十 ZazyA)|zi 一 zol . 


Hence， 


志 
挟 2 
lm zolsI7 和 


Combining both inequalities and recalling (3.4), we obtain 





lei -coll。 . 


| 五 ci 二 五 aoll~ < (了 wy 十 和 ZLzy) oa ao|ll= 。 


Since the inequality 
了 1 十 ALay 反 工 (3.0) 


is satisfied by the 入 of (3.9) under condition (3.2), 互 is indeed a contraction. 


(c) Itremains to show that the invariant manifold AM is attractive. With (2 人 妇 
@(z,y)， we write 
8 sG) = 9g(2 切 一 sz+ Hg) 
= (ge 引 -g(es(O))+(sz+flcso))-sz+yflcg)) 
Here we used the identity 
s(z+ frs(o)))= az+ les(o))= gzs(z))， 


Which holds because 3 = 5 and by construction of the Hadamard transform. II 
follows that 





侈 一 52| 等 (Zasy 十 和 Zeo) ly 一 5， 
which together with (3.6) yields the result. 
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Next we Study the effect of a perturbation of the map on the invariant manifold. 


Theorem 3.2， Co1sider 11apy Go GO1 :和 X 了 一 XXXY Doorwjicpnsatish tpe 
CO1di1tio18 Oo 7Tjeore11 了 7 Wi 太 1pe Sa111e ZipSscHitz co11S1G1118 人 zz 了 zh Loz Loy 
ZLefl 5s0 a1d 5s1 pe 1je jazzctio1ny dejizaipng 1Pe attiracfive ivariant 1Ia111o1ds 人 410 记 
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lsi(z) 一 so(zZ)| < 二 7 Jor ZEX. 
(Bere 入 ad pda1e dejijzed as z0 7T1jeore111 了.71.) 
Poof The proof is similar to part (b) of the previous proof. Let 人 E X. For ; 三 
0,1,， we have si(Z) = 0i(z2isi(zi)) with 2i defined by the equation 到 一 2 十 
万 (2i si(Zi)). We estimate 
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< 也 zlsl 一 50||。 十 (Zr 十 工 Wy 入 |z1 一 20|| 
十 lgai(zo, so(zo)) 一 go(zo, so(Z0) 
and in the Same Way 

za 一 zol 和 | Atzasitzi)) 一 万 (zal)so(z1)) 
十 | Acayso(zi)) 一 万 (zo,so(Zo)) 
十 | Aizo,so(zo)) 一 大 4zo,so(zo)) 
们 了 zy|lsl 50|l- | (Zazz | 了 zy 入 ) |z1 一 20|| 
十 | Aikzo,so(zo)) 一 大 4zo,so(zZo)) 

















JInserting the Second bound into the first one and using (3.4) and the assumed bound 
on 901 一 So gives 


sl 一 soll。 < ww 十 AZLzo)llsl 一 so 十 0， 











which implies the result. 
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XIIL.4 Weakly Attractive Invariant Tori of Perturbed 
Jntegrable Systems 


We assume that the perturbation is dissipative such that one torus persists 
under the perturbation and gets attractive. 


Our analysis is done by the method of averaging. The problem of this sec- 
tion is classical, see e.g. Bogoliubov 女 Mitropolski (1961), Kirchgraber 
多 Stiefel (1978). (D. Stoffer 1998) 


In the example of the Van der Pol equation， we have Seen that only one of the peri- 
odic orblts of the harmonic oscillator persists under the Small nonlinear perturbation 
and becomes an attractive limit cycle. More generally, we consider perturbations of 
integrable Systems 

Q 王 Er7(a,0) 


(4.D 
0 = w(aj+sp(a,0) 


where (locally) just one invariant torus Survives the perturbation and attracts nearby 
Solutions. Using the results of the two previous Sections, it will be Shown that this 
Situation occurs 放 at some point a* where the frequencies wi(a”") are diophantine， 
the angular average F(a") is small and its Jacobian matrix 


4 一 7(o”) 


has all eigenvalues with negative real part. 

The following theorem is aslight modification of aresult of Stoffer (1998). Early 
verslions of it are much older; see the citations above. The origins of the problem can 
be traced back to the work of Van der Pol (1927) and 下 rylov 儿 Bogoliubov (1934). 

Here we assume the following: w(a”) satisfies the diophantine condition (X.2.4) 
With exponent 7. The perturbation functions r(a,09) and p(a,0) are real-analytic 
on a fxed complex neighbourhood of {a*} x TY and bounded independently of < 
(though they may depend on s). In some norm || . || on 陈 4 and its induced matrix 
norm, the bounds 


上 Fo 和 Cllog sl-20+9) (4.2) 
| 


| 入 


le et for tt>0 (4.3) 


hold with some constants C and a > 0. 


Theorem 4.1，Vmder 1Pe apove co11ditio11y, Jr SUHcienty S1G1L E > 0 1je yyste111 
(4.I) pas an inyarianattorsy 天 Wpicp altpack anO(log sl- 一 D-meigjpoxrpood oF 
{a*] x Td wiip an expomential rate proportional 1o <. 


Prooj The proof combines Lemma 2.1 and Theorem 3.1. For convenience We aS- 
sume o 一 0inthe folowing. Lemma 2.1 (with N 三 3) gives us a change of 
coordinates (oa 0) 上 (bp), O(E)-close to the identity, such that for | 和 6 with 
0 一 cllogsl-“ 汪 Lof (2.9)， 


XIIL53 Weakly Attractive Invariant Tori of Numerical Integrators 4065 


0 一 sna21(b) 十 s27702(b) 十 O(es3/162) 


(4.4) 
六 = ww 上 ea( 儿 上 es2jo( 四 十 OUe3152) . 


Since mi(b) = FU = 4 十 O(02) by (4.2), this system is of the form 
D = <40 十 O(s02) 
2 = w() 十 O(e) . 
Similarly, the corresponding variational equation ls of the form 
PN _ es4+O(e5) OUes/62)\ 1 
五 / O(1) @O(s3/62) / 
These relations and condition (4.3) imply that, for sufficiently small s and for any 
fixed 7 > 0, the time-7 flow of (4.1) maps the strip 万 = {(2p) : | < 寺 6,P e 


下 中 into itself, and the following bounds hold for the derivatives of the solution with 
Tespect to the initial values: 




















购 |=w-rem | 
(4.5) 
项 |s-o， | 二- 十 www 





Hence, for sufficiently Small =， 
Top 十 Ti 十 2VTebEoo<errso/2 <1. 


Theorem 3.1 (and Exercise 1) used with 2,b in the roles of z,Vy now Shows that the 
time-7 fow has an attractive invariant torus {(s(P),o) : P ETd}, where s:Td 一 
{ol < 寺 6} is Lipschitz bounded by 入 = 27oo/(1L 一 Deep 一 并 op) = O(e3/62). 
This invariant torus attracts orbits of the time-7 How map in the strip 忆 with the 
attractivity factor 和 72p 十 卫 吧 冬 e-7so/2. As Exercise 2 shows, the torus is actually 
invariant for the differential equation (4.1). 














XIILS Weakly Attractive Invariant TIori of Numerical 
Jntegrators 


Does the attractive invariant torus of Theorem 4.1 persistunder numerical discretiza- 
tion of the perturbed integrable System'? This question was first studied by Stoffer 
(1998) who worked directly with the discrete equations in his analysis. Here we take 
up the approach of Hairer 儿 Lubich (1999) where the problem was studied by com- 
bining backward error analysis and perturbation theory, Simijlar to what was done in 
the two preceding chapters. 
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XII.S.1 Modified Equations of Perturbed Differential Equations 


Below we need to use backward error analysis for the numerical solution of a per- 
turbed differential equation 


= jy)+Tsg(s)， 0) = 加 (5.1) 


with real-analytic functions 厂 and 9 and Small parameter s. We consider applying a 
one-step method Vi = 咯 (yo) of order p 之 1 with step size 几 > 0. The associated 
modified differential equations constructed in Chap.IX are then of the form 


= 万 念 二 ec8( 人 六 se) ， 了 0) = 加 (G.2) 


with suitably truncated series 


jO) = jg 种 记 1O 二 二 PTLwO 
Js) = 9g( 信 es) 十 12gp+i( 仿 s 十 十 ji 1gNw(e)， (5.3) 


Where the functions 方 are independent of <s, 几 , N，whereas the functions 9j are 
allowed to depend on s. The following adapts Theorem IX.7.6tothe above Situation . 


Theorem 5S.1. Zef /yy) 十 sg(y,Es) pe 7eal-amalytic (V ad =) ad Donded py MI 
Jory eE Bop(yo) and Jor all complexr swW 商 | < so. Zet the coefjcients or 1e 
TByior seriesy (太刀 of te 1taie1ical 11etpod pe aaa1ytic 太 BR(yo) wii porxnds 
(2X73) Jorlsl 和 so. 7T1em tpere exisg Po > 0 (proportional io 民 ][I) sucj tpat 
Jor 关 和 jpo/4andjporN = NO tpe largestinteger miihAN<jo tpe djerence 
Defwee1 te Jaierical solufion 册 三 芋 E(V0) and 加 e exactsolufion 08 (yo0) of te 
17r1cated 111odijied equatio1l (5.2)-(5.3) Satisjres 


8(o) 一 共 (o) < Che 人， 


TJe junctions 六 and Torf5.3) are reaLanalytic 六 R(yo) wi 态 


|O 一 天 OCR 5 有 一 9 信和 CN 
JoryE BRp/a2(yo) aad | < so0. 7T1pe comstants C ae independent ormu <jo/4and 


ls| < so0. 


Proojf The exponentially small estimate for 有 5(yo) 一 吃 和 A(yo) is that of Theo- 


Iem IX.7.6 applied to the differential equation (5.1). The O(M2P) bound for jy) 一 
jy) is the estimate (IX.7.14) applied to 7 = JJ. By applying that estimate to 
(5.1D), abound of the same type is obtained for ( 放 J) + sz se) 一 (FJ 上 +sg( 人 )， 
uniformly for all complex s in the complex disk |s| 和 so. For any fixed y E 


刀 Ry/z(yoj, the difference 


je-oa=a(GgTee-Gg+eoel 玫 -7 


is an analytic function of s in the complex disk |s| < so, which is bounded by O(AZ) 
for |s| = so. By the maximum Principle, the same bound then holds for |s| < so. 
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XII.S.2 Symplectic Methods 


We apply a symplectic integrator with step Size 玫 to areal-analytic perturbed inte- 
grable Hamiltonian system in coordinates (D, 9)， 


0 万 





六 = 一 训 人 9 二 eg) 
和 
(5.4) 
= 页 吕 人 +etp,g)， 
P 


We asSsume that the unperturbed System (E 三 0) ls a completely integrable SYS- 
tem which satisfies the conditions ofthe Arnold-Liouville theorem, Theorem X.1.0. 
Hence, there exists a transformation to action-angle variables for the 


ifeg1apje System (D,dq) mm (oa 0) by Theorem X.1.6. 


This change of coordinates transforms the integrable System to the equations w = 0， 
0 = w(a)j,andit transforms (3$.4) to a system (4.1), for which we assume (4.2), (4.3) 
and the diophantine condition (X.2.4) with exponent Z for w(a"). The following 
theorem is a _ variant of results in Stoffer (1998) and Hairer 儿 Lublich (1999). 工 
Shows that for Symplectic methods, the invariant torus perslsts under a very mild 
Testriction on the Step Size. For non-Symplectic methods, this would require Step 
Sizes 几 with 1 < 祥 s (See Exercise 9). 


Theorem 5S$.2. ZLetl a sy11Dlectic AU1ierical integrator oforderDp pe applied io a Per 
1U1ped ;nt1eg1aDie 有 aazlio1lia1 8SySte1a (3.4) WAich Satis1jies 1pe co1aditio11s Staled 
Gpoyve. 71e11,， 1pere exisf so > 0 ad co > 08SuUcn tt Jor0<= 芝 50da1djor 
SteD Sizey 忆 > 05SalisAzg 

AP < collogs| (5.9) 


Wi A 三 max(z 十 Q 十 1)D)， te Pierical 1metpod jas a1 attractive iva7ia1at 
io 天 六 7TJig iorszO(P) close io tpe ivariantiorus or. 逢 .下 attracts a1 
GO(log sl- 笃 ) neigpbporxrpood wiip an expomenfial rale proportional io ec Untorapy 
态 几 . 


Remark S.3. The exponent 十 d 十 1 comes from Lemma X.4.1. It could bereduced 
to 十 1by using Russmann's estimates in place of that lemma; cf. the remark after 
Lemma .4.1. 


Proofof7jpeore1z 5.2. The proof combines backward error analysis (Theorem 
IX.3.1 and Theorem 5.1)，perturbation theory (Theorem X.4.4 and Lemma 2.1)， 
and the invariant manifold theorem (Theorem 3.1). 

(a) We begin by considering the Symplectic method applied to the integrable 
了 amiltonian System (3.4) with s = 0. This leads us back to the questions of Chap. 久 . 
We use backward error analysis and recall (Theorem IX.3.1) that the modified equa- 
tion is again Hamiltonian and an O(A2) perturbation of the integrable System, both 
in the (2,9) and the (o, 0) variables. We transform variables for the 
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1iOdijiea equation of tjpe integraple syste11 (a;,0) 一 (0 0) by Theorem 和 .4.4， 


with A2 in the role of the perturbation parameter. By (X.4.1) with N proportional to 
|log sl, and by condition (5.5) with a sufficiently small co, the modified equations 
in these variables become 


= Of(c3) 


匡 for | 人 阐 一 ol < cllogel- 关 ， 
w(o) 十 O(z3) 


0 


with 六 (可 = w( 有 四 二 O(MP). Moreover the transformation (a,g) > (也 骨 is 
O(P2P) close to the identity. 

(b) The modified equations of the perturbed system,， written in the (@W 何 Vari- 
ables, become 


羡 = ce 全 骨 十 Ofe3) 


for ll--ox* < crllogsl- 和， (0G.0) 
0 = (+esjz0)+O(c3) 


where 六 0) = 7(20) 二 OU) and 六 200) = p( 世 角 十 O(iP) by Theorem 5.1. 
Consider now these equations with the O(s3) terms dropped. We change variables 
for the 


1iodijiead eguation of te perturped System (0 0) 上 (bO) by Lemma 2.1. 


(Note Exercise 4 with w(o”) = w(o) 二 O(P) and (5$.5).) The System (3.6) is 
transformed to the form of (4.4)， 


0 = c 记 (人 十 O(e3/162) 


广 = ( 人 十 sz 十 O(e3/152) 


(3.7) 


with g = cr*|logel-2x,and where 遍 ( 人 一 珊 人 十 O(e/16) = 7 人 十 OU) 二 OUe/10)， 
and also the Jacobian of 刺 at ax is close to that of F, so that it satisfies again (4.3)， 
at least with a replaced by a/2. In the same way as in the proof of Theorem 4.1 and 
with the Same Lipschitz constants as in (4.3), we now obtain an attractive invariant 
torus of the modiftied equation of the perturbed system. The time- 岂 fow of this 
equation is an exponentially small (in 1/P) Lipschitz perturbation of the numerical 
one-step map, so that under condition ($.3) it is an O(s3) perturbation. Therefore， 
Theorem 3.1 yields an invariant torus 7T< hn of the numerical method. 

(c) It remains to bound the distance between the tori T<) and 7T<. We recall that 
< was obtained by a transformation of the 


Perturped syste11 (a0) 一 (bpP) by Lemma 2.1， 


which puts (4.1) into the form (4.4). We thus have the transformations 
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着 | 


一 和 
(0 一 


Where the Symbols AM and s indicate that the transformation is O(P2P) or O(s) close 
to the identity. By the construction of Lemma 2.1, the composed transformation 
(六 p) > (已 加 isO(iP) close to the identity and moreover the right-hand sides of 
(4.4) and (5.7) differ by O(sj2). Theorem 3.2 (with p 三 e-s7o/2) now Shows that 
the functions sc< 六 and se deftining Zn and Te, respectively, differ by O(AZP). This 


yields the desired distance bound. 














XII.S.3 Symmetric Methods 


Aresult analogous to the theorem of the previous Subsection holds for reversible 
methods applied to perturbed reversible Systems 


勾 一 juu) 十 sK(V) 
站 三 9g(uu) 十 su VD) 





where the unperturbed System (E = 0) ls areal-analytic integrable reversible System. 
于 the perturbed System, written in action-angle variables of the unperturbed System， 
Satisfies the conditions of Theorem 4.1, then a reversible analogue of Theorem 5.2 
holds, where the terms “symplectic” and “Hamiltonian” are Simply replaced by “re- 
versible"”. The proof remains the Same, working with the reversible analogues of the 
results used for the Hamiltonian case. 


XII.6 玉 xXercises 


1. In the Situation of the invariant manifold theorem，Theorem 3.1，Suppose im 
addition that /and 9 are a-periodic in z: jz 十 ay) = jz 9 二 ay) 三 
g(z,y) foralzeX,yEY. Show thatin this case the function s defining the 
invariant manifold is also w-periodic. 

ZL The Hadamard transform maps aw-periodic functions to aw-periodic func- 
tions， 

2. Show that it the time-7 fow map 瑟 = pr of adifferential equation has an 
attractive invariant manifold 人 4, and ifthe flow wp maps a domain of attractivity 
of A underg into itself for every real t then 人 is also invariant under the fow 
oOt for every real 志 
BILE Write of 王 2 optog andusetheattractivity of 人 for 7 一 co. 
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Prove that in the situation of Theorem 3.1, iterates (Zn+l,yn+i) = (znyn) 
have the property of asy11tptotic Pjase (Nipp 色 Stoffer 1992): there exists a Se- 
quence (Zn 各) of iterates on the invariant manifold, i.e., with (Zn+1,Vyn+1) 一 
SB(zZn 加 ) and 人 =s(zn),such thatforal72 >0， 


2 一 和 | < cl 一 szn)l 
| 一 名 SGL+ 和 cllyn 一 szo) 儿 ， 


where c = X/(1L-- AX) with 入 = 27oz/(L 一 Lazz 一 了 my) of (3.3) and 
入 ”二 27ay/( 一 了 zz 一 了 yy). Note that | 一 szm)j| 和 "lyo 一 szo)l 
by Theorem 3.1. 

碳 iPL Consider the sequences ( 寂 昌 ， 隐 人 ) defined by 区 ) 三 4 太 其 人 一 5( ZN) 
and ( 宕 全 基 人 ) 一 画 (人 区) form 一 天 一 1 10. Show that for 


fixed 7, the sequence (zz 人) (让 之 mn) is a Cauchy sequence. 


. Show that Lemma 2.1 holds unchanged iftthe diophantine condition (和 .2.4) for 


w(ar) is weakenedto w(a*) = wx* 二 O(02) with w* satisfying (X.2.4). 


. In the Situation of Theorem 5.2, Show that every numerical integrator of order 7P 


has an attractive invariant torus 让 2 < s. This torus is O(A2P/s) close to the 
invariant torus of the continuous System. 


Chapter XIIIL. 
Oscillatory Differential 上 Equations 
with Constant High Frequencies 


This chapter deals with numerical methods for second-order differential equations 
with oscillatory solutions. These methods are designed to require a new complete 
function evaluation only after a time step over one or many periods of the fastest oS- 
cillations in the System. Various Such methods have been proposed in the literature 一 
Some of them decades ago, Some very recently, motivated by problems from mole- 
cular dynamilcs, astrophysics and nonlinear wave equations. For these methods it ls 
not obvious what implications geometric properties like Symplecticity or reverSsibjil- 
ity have on the long-time behaviour， e.g.,， on energy conservation. The backward 
error analysis of Chap.IX,， which was the backbone of the results of the three pre- 
ceding chapters, 1s no longer applicable when the product of the step Size with the 
highest frequency is not Small, which is the Situation of interest here. The“exponen- 
tially small” remainder terms are now only O(1)! For differential equations where 
the high frequencies of the oscillations remain nearly constant along the solution， 
a Substitute for the backward error analysis of Chap.IX is given by the 11odxialed 
Zouxyier expa118io11y of the exact and the numerical solutions. Among other proper- 
ties, they permit us to understand the numerical long-time conservation of the total 
and oscillatory enersgies (orthe failure of conserving energy in certain cases). Itturns 
out, Symmetry of the methods is still essential, but Symplecticity plays no role in the 
analysis and in the numerical experiments,，and new conditions of an apparently 
non-geometric nature come into play. 


XIIU.1 Iowards Longer Time Steps in solving 
Oscillatory Equations of Motion 


Dynamical Systems with multiple time scales pose a major problem in 
Simulations because the small time steps required for stable integration 
of the fast motions lead to large numbers of time steps required for the 
observation of slow degrees of freedom and thus to the need to compute 
alarge number of forces. 

(M. Tuckerman, B.J. Berne 儿 G.J. Martyna 1992) 


We describe numerical methods that have been proposed for solving highly os- 
cillatory Second-order differential equations with fewer force evaluations than are 
needed by standard integrators like the Stormer-VYerlet method. We present the ideas 
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underlying the construction of the methods and leave numerical comparisons to 
Sect.XIIH.2 and the analysis of the methods to Sections XII.3-XII.06. We conslider 
only methods that are Symmetric or Symplectic. The presentation in this Section 
follows roughly the chronological order. 


XII.1.1 The Stormer-Verlet Method vs. Multiple Time Scales 


Perhaps the most widely used method of integrating the equations of mo- 
tion is that initially adopted by Verlet (1967) and attributed to Stormer. 
(M.P Allen 多 D.J.Tildesley 1987, p.78) 


The Newtonian equations of motion of particle Systems (in molecular dynamics， 
astrophysics and elsewhere) are Second-order differential equations 


= -VV(g. (GD 


To Simplify the presentation， we omit the positive definite mass matrix AM which 
would usually multiply 4. This entails no loss of generality, since a transformation 
d 一 MI12g andV(d) 一 了 (MT-LH29) gives the very form (1.1)， 

The standard numerical integrator of molecular dynamics is the Stormer-Verlet 
Scheme; See Chap.I. We recall that this method computes the new positions qn+1 at 
time t 妃 +1 from 

on 一 2 十 ol = 用 记 (1.2) 


with the force 万 = --VV(dan). Velocity approximations are given by 


dmn+l 一 dm 一 1 


In its one-step formulation (see (L.1.17)) the method readsl 


Dr+1/2 二 Din 工 3 九 记 
dmn+l 二 dm 下 RDn+172 (1.3) 
Pn+1l 三 Prn+1/2 十 双 廊 +1 








We recall that this is a Symmetric and Symplectic method of order 2. For linear 
Stability, ij.e., for bounded error propagation in linearized equations, the Step Silze 
must be restricted to 

APw <2 


where w jls the largest eligenfrequency (le.，Square root of an eligenvalue) of the 
Hessian matrix V2V(dg) along the numerical solution; see Sect.I.5.1. Good energy 
conservation Tequires an even Stronger restriction on the step Size. Values of Pw 祥 到 
are frequently used in molecular dynamics simulations. 

The potential Y(d) is often a sum of potentials that act on different time Scales， 


1 We write p when the Hamiltonian structure and symplecticity are an issue, and d otherwise. 
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VY(q) = 三 丈 (q)+VG with V2TY7(q) positive semi-definite and 


1.4 
lvarr(gll > lvzr(ol， 9 


In this situation, solutions are in general highly oscillatory on the Slow time Scale 
rw~L/|var(OlP/2， 

In particular when the Jest forces -VI(d) are cheaper to evaluate than the 
Slow forces -- VV(qj,itis of interest to devise methods where the required number 
of Slow-force evaluations ls not (or not Severely) affected by the presence of the 
fast forces which are responsible for the oscillatory behaviour and which restrict 
the Step Size of standard integrators like the Stormer-Verlet Scheme. This Situation 
occurs in molecular dynamics，where 作 (9) corresponds to short-range molecular 
bonds, whereas UV(a) includes inter alia long-range electrostatic potentials. 

In Some approaches to this computational problem, the differential model is 
modified: highly oscillatory components are replaced by constraints (Ryckaert, Cic- 
cotti 儿 Berendsen 1977), or Stochastic and dissipative terms are added to the model 
(See 9Schlick 1999). Such modifications may prove highly Successful in Some appli- 
cations. In the following, however, we restrict our attention to methods which aim 
at long time steps directly for the problem (1.1) with (1.4). 

Spatial semi-discretizations of nonlinear wave equations，Such as the Sine- 
Gordon equation 

2tt 一 Wzz 一 Si ， 


form another important class of equations (1.1) with (1.4). Here 作 (9) 三 97 49， 
where 4 is the discretization matrix of the differential operator 一 9210z2. 


XIIHU.1.2 Gautschi's and Deufhardq's Trigonometric Methods 


It is anticipated that trigonometric methods can be applied，with simi- 
lar Success, also to nonlinear differential equations describing oscillation 
phenomena. (W. Gautschi 1961) 


The oldest methods allowing the use of long time steps in oscillatory problems con- 
cern the particular case of a quadratic potential 全 (9) 三 3w2d7q with w 祖 1, for 
which the equations take the form 


4 = 一 w2d 二 9g(q) . (1.5) 


For such equations，Gautschi (1961) proposed a number of methods of multistep 
type which are constructed to be exact 让 the Solution is a trigonometric polynomial 
inwtof a prescribed degree. The Simplest of these methods (and the only Symmetric 
one) reads 

danH 一 2 十 gl 一 jsinc( 寺 pw) 如 ， (1.6) 


where sincE = sinE/c and 如 = -ww20 十 gn with gn = 9g(anj, orequivalently 


qdnH1 一 2cos(huo)on 十 dm 1 三 7 sinc2( 寺 pw) 人 (1.7) 
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The method gives the exact solution for equations (1.3) with 9 = Comst and arbi- 
trary w (See also Hersch (1938) for such a construction Principle). This property is 
readily verifed with the variation-of-constants formula 


人 
(jeaaja 


This formula also shows that the following Scheme for a velocity approximation 
becomes exact for 9g 王 Co7st 


0 一 01=27sinc(ju) 如 ， (1.9) 


Starting values q1l and di are also obtained from (1.8) with q(qo) in place of q(q(s)). 

Deuflhard (1979) considered /2-extrapolation based on the explicit symmetric 
method that is obtained by replacing the integral term in (1.8) by its trapezoidal rule 
approximation: 


an CogS hv Sinc Acww qn 上 妃 ? Sinc(Aw) gn 
jn \ 一 puosin jp cos jw POn 2 \gn+rli+cos(po)ogn 太 
(L10) 


Eliminating the velocities yields the two-step formulation 
qdnH1 一 2cos(hwjan 十 dn_1 一 jsinc(Ppo)ogn . (1.11) 
The velocity approximation is obtained back from 
21sinc(hw)dn 一 qn+l 一 an 1 (1.12) 
or alternatively from 


gm 二 1 一 gm 一 1 
过 攻 人 交 人 
2 


Both Gautschi's and Deufhard's method reduce to the Stormer-Verlet Scheme for 
w 一 0.Both methods extend in a Straightforward way to Systemas 


= 一 4dg+9g(g) (1.13) 


with a symmetric positive semi-definite matrix 4，by formally replacing w by 
= 41/2 in the above formulas. The methods then require the computation of 
products of entire functions of the matrix 刀 2 4 with vectors. This can be done by 
diagonalizing 4，which is effcient for problems of small dimension or in Spec- 
tral methods for nonlinear wave equations. In high-dimensional problems where 
a diagonalization is not feasible, these matrix function times Vector products can 
be effticiently computed by Superlinearly convergent Krylov Subspace methods, see 
Druskin 馆 Knizhnerman (1995) and Hochbruck 氏 Lublich (1997). 

The above methods permit extensions to more general problems (1.1) with (1.4)， 
but this requires a reinterpretation to which we turn next. 


dmn+1 四 2cos(jhw)on 十 dan 1 
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XIIHU.1.3 The Impulse Method 


JIntegrators based on T-RESPA [.…] have led to considerable speed-up in 
the CPU time for large scale simulations of biomacromolecular Solutions. 
Since IT-RESPA is symplectic such integrators are very Stable. 

(B.J. Berne 1999) 


The Stormer-Verlet method (1.3) can be interpreted as approximating the fow 这 
of the system with Hamiltonian 万 (2,9) = 了 (Oo) 十 V(9) withT(p) = 57p by the 
Symmetric Splitting 
2jj2 9 9 op1ja ， 

which involves only the flows of the systems with Hamiltonians T (DZ) and 了 (9q)， 
which are trivial to compute; See Sect. I.5. 

In the Situation (1.4) of a potential Y 三 作 十 避 , wemay instead use a different 
splitting of = (下 十 厂 ) 十 Vand approximate the fow (5 of the System by 


过 T+ 
IPA/2 9 7 9 PP/2 ， 


This gives a method that was proposed in the context of molecular dynamics by 
Grubmitller Heller Windemuth 儿 Schulten (1991) (their Yerlet-I Scheme) and by 
Tuckerman, Berne & Martyna (1992) (their T-RESPA scheme). Following the termi- 
nology of Garcia-Archilla, Sanz-Serna 多 Skeel (1999) we here refer to this method 
as the 17IDULSe 111etpod: 


1. kick: set ?机 = pn 一 二 VU(dn) 
2. oscillate: solved = --V(dq) with initial values (qu,2 二 ) 


over atime step 由 to obtain (an DnH1) 4.14) 


3. kick: set pn+1 三 DT 一 3VU(dn+l) 


Step 2 must in general be computed approximately by a numerical integrator with 
a Smaller time step，which results in the multiple time stepping method that we 
encountered in Sect. VIIU.4. 过 the inner integrator is Symplectic and Symmetric, as it 
would be for the natural cholice of the Stormer-Verlet method, then also the overall 
method is Symplectic - as a composition of symplectic transformations, and it ls 
Symmetric - as a Symmetric composition of Symmetric stepS. 

It is interesting to note that the impulse method (with exact Solution of step 2) 
reduces to Deufhard's method in the case of a quadratic potential f 太 (d) 三 397 .44 
(Exercise 1). 

Though the method does allow larger step Sizes than the Stormer-Verlet method 
in molecular dynamics Simulations, it is not free from numerical difficulties. Bliesa- 
decki 久 Skeel (1993) and Garcfa-Archilla et al. (1999) report and in linear model 
problems analyze instabilities and numerical rfesonance phenomena when the prod- 
uct of the step size 九 with an eigenfrequency w of V2T is near an integral multiple 
of 克 . 
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XIIU.1.4 The Molliftied Impulse Method 


We also propose a nontrivial improvement of the impulse method that 
we call the 12o11iead ;pulse etpod, for which superior stability and 
accuracy ls demonstrated. 

(B.Garcfia-Archilla, J.M. Sanz-Serna 氏 R.D. Skeel 1999) 


Diftculties with the impulse method can be intuitively Seen to come from two 
Sources: the slow force --VU(9) has an effect only at the ends of a time step, but if 
does not enter into the oscillations in betweeni the Slow force ls evaluated, Somewhat 
arbitrarily, at isolated points of the oscillatory solution. 

Garcfa-Archilla et al. (1999) propose to evaluate the Slow force at an ave1aged 
value = a(an). They replace the potential V(d) by U(a) = UV(a(da)) and hence 
the Slow force -VUV(dq) in the impulse method by the mo1lijied Jorce 


-VT(g = -ao(0O7YU(a(o) (1.15) 


Since this 1OVLi1jed zDULse 1ietjpod is the impulse method for a modified potential， 
it ls again Symplectic and Symmetric. 

There are numerous posSibjlities to choose the average a(dqn), but care should be 
taken that it is only afunction of the position gn and thus independent of pn, in order 
to obtain a symplectic and Symmetric method. This precludes taking averages of the 
Solution of the problem in the oscillation Step (Step 2) of the algorithm. Instead, one 
Solves the auxiliary initial value problem 


站 = -VIF(z) with z(0) =q 2z(0)=0 (1.16) 


together with the variational equation (using the Same method and the Same step 
SiZe) _ 
X =-V2T(z)X with X(0) = X(0)=0 (1.17) 


and computes the time average over an interval of length cn for some c > 0: 


1 C 凡 1 C 户 
a(9) 一 1 zZ( 攻 dt au 人 (do) = 9 Xiat. (1.18) 


Garcfa-Archilla et al. (1999) found that the choice c = 1 gives the best results. 
Weighted averages instead of the Simple average used above give no Improvement. 

Izaguirre, Reich 儿 Skeel (1999) propose to take au(d) as a projection of g to the 
manifold Vf(9) = 0 of rest positions of the fast forces, for situations where all 
non-Zero eigenfrequencies of V2TI(o) are much larger than those of V2U(d). This 
choice ls motivated by the fact that solutions oscillate about this manifold. 

We now turn to the interesting Special case of a quadratic 厂 (9) = 39749 with 
asymmetric positive semi-definite matrix 4. In this case, the above average can be 


computed analytically. It becomes 


a(9q) 一 02)9 
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with 2 = 412 and the function %(E) = sinc(c6). For a(q) defined by the orthogo- 
nal projection to 4d = 0 we have %(0) = 1 and %(5) = 0 forec away from 0. With 
on 三 一 0OIVU(OO)qn),the molliftied impulse method reduces to 


7 三 _Dn 十 gr 





dan+L1 cosP2 玫 Sinc 肠 12 国 (19) 
DT  N-2snnh2 cos 2 
Drn+1 一 DA 十 Pgn+l ， 


This can equivalently be written as (1.10) with the same 9qn (and fY2 in place of ww)， 
or in the two-step form (1.11) with (1.12). 


XIIH.1.S Gautschis Method Revisited 


We recall that Gautschi's method (1.7) (with 2 = 41/2 ip place of w) integrates 
equations 4 = 一 4q 十 q(d) exactly in the case of a constant inhomogeneity 9(d) 三 
Co1st. This property is obviously kept if the argument of 9 in the algorithm is 
modified to 
gm 三 9gOUO2)dqn) 
Similar to the previous Subsection. Such Gautschi-type methods were analyzed by 
Hochbruck 久 Lubich (1999a). Functions % with %(0) = 1 that vanish at integral 
multiples of rr give a Substantial Improvement over the original Gautschi method. 
The choice 
%(6) = sincE(l 十 地 sin2 356) (1.20) 
was found to give particularly good accuracy. The methods are Symmetric but not 
Symplectic. 
The following symmetric method for general problems (1.1) with (1.4) was pro- 
posed by Hochbruck & Lubich (1999a). The method reduces to Gautschi-type meth- 


ods for quadratic 人 (9) 三 二 744. Given dg and dh，one computes an averaged 
value 0 = a(an) and the solution of 
忆 三 -V 玉 (一 VC) with (0) =an， (0) = (1.21) 


backwards and forwards on the intervals from 0 to 一 凡 and 0 to 凡 . Note that this 
redquires only one evaluation of the Slow force 一 VLU. Then,， an+l and w+ are 
computed from 

dm 十 1 攻 水 各 -1 二 后 1 2 十 以 一 几 ) (1.22) 

HE 一 各 -1 = 2 人 JP) 一 信 一 站， 

When the differential equation for w is Solved approximately by a Symmetric nu- 
merical method with smaller time steps, then this becomes a Symmetric multiple 
time-stepping method. For the interpretation as an averaged-force method and for 
the corresponding one-step verslion, where the initial value for the velocity in (1.21) 
js replaced by (0) = 0, we refer back to Sect.VJII.4 (where qn instead of the 
average TI, 一 Qa(dn) was taken as the argument of the slow force 一 VD). 
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XIIH.1.6 Two-Force Methods 


Hairer 儿 Lubich (2000a) compare the analytical solution and the numerical Solu- 
tions given by the above methods in the Fermi-Pasta-Ulam model of 9%ect.I.3.1， 
using the tool of modulated Fourier expanslons (See 9ections XIII.3 and XII.3 be- 
low). Their analysis of the Slow energy exchange between Stiff Springs leads them 
to propose the following method for equations4 = 一 4 十 9(q), which requires two 
evaluations of the slow force per time step: with f = 41/2, set 


dnf 一 2cos(PhD)an 二 dl = 忆 sinc(PO2)9g(o) 十 jd (1.23) 

with 
du = sinc2(PPD) g(qn) -sinc(2)9g(sinc(P2)an) . (1.24) 
This method gives the correct Slow energy exchange between Stiff components in 
the model problem and has better energy conservation than the Deufhard/impulse 
method. With the velocity approximation (1.12) the method can equivalently be 
wiitten in the one-step forms (1.19) or (1.10). The method extends again to a Sym- 


metric method for general problems (1.1) with (1.4), giving a correction to the im- 
pulse method: let q(9) = -VV(q) andlet ua(q) be defined by (1.18) with c = 1. Set 


2 = 一 a(dn) and 

本 2 

7(w) = 雹 (a(w 十 二 2g(qn)) 一 adn)) ， 
The method then consists of taking 


gn = 9g(qn) 十 了 (qdn) 一 9(2n) 


instead of g(o) = 一 VV(an) in the impulse method (1.14). 
Atwo-force method with interesting properties, for Situations where all non-zero 
eigenfrequencies of 4 are much largerthan those of V2U(dqj,is given by (1.23) with 


du = sinc2(3AD2)9g(X(PO2)an) 一 sinc(PD) 9(X(PD)an) ， (1.25) 


where X(0) =1and X(6) =0for 和 away from 0. 


XIU.2 A Nonlinear Model Problem and Numerical 
Phenomena 


To gain inslight into the properties of the various numerical methods described in 
the previous Section, it 1 helpful to study the methods when they are applied to 
Suitably chosen, rather Simple model problems which show characteristic features 
but are still accessible to an analysis. Such an approach has traditionally been very 
Successful for stifft differential equations (See, e.g., Hairer 贸 Wanner 1990). For the 
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present stiff-oscillatory case we investigate the behaviour of the numerical methods 
on nonlinear Systems 


这 十 27 = 9g(Z) (2.1) 
with a Smooth gradient nonlinearity 9(Z) = --VU(z) and with the square matrix 
0 0 
ae 上 由 六 1 工 ， (2.2) 


with blocks of arbitrary dimension. We consider only solutions whose energy js 
bounded independently of w, So that in particular the initial values Satisfy 


直 2(0) 上 十 引 2z(0) 上 2 < (2.3) 


with 巴 Independent of w. 

The Fermi-Pasta-Ulam (FEPU) problem of Sect.I.5.1 belongs precisely to this 
class, and we will present numerical experiments with this example. In the model 
problem (2.1) with (2.2) we clearly Impose Strong restrictions in that the high fre- 
quencies are confined to the linear part and that there ls a Single, constant high fre- 
quency. The extension to Several high frequencies will be given in Sect. XIII.9, and 
constant-frequency Systems with a position-dependent kinetic energy term are con- 
Sidered in Sect.XII.10. Oscillatory Systems with time- or Solution-dependent high 
frequencies will be studied,， with different techniques and for different numerical 
methods, in Chap. XIV. 

In any case, satisfactory behaviour of a method on the model problem (2.1) can 
be anticipated to be necesSsary for a successful treatment of more general Situations. 


XIII2.1 Time Scales in the Fermi--Pasta-Ulam Problem 


The FPU model shows different behaviour on different time Scales: almost-harmonic 
motion of the stiff springs on the time scale w-1, motion of the soft springs on the 
Scale w0, energy exchange between stiff springs on the time scale w, and almost- 
preservation of the oscillatory energy over intervals that are exponentially long in 
w. This ls ilustrated in the following. 

We conslder the FPU problem with three stifft Springs with the data of Sect. I.3.1. 
The four pictures of Fig.2.1 show the evolution of the following quantities: the total 
energy 


瓦 (z,2) 一 32 人 二 32702277 十 U(z)， (2.4) 


(or rather 妃 一 0.8 for graphical reasons), which is a conserved quantity; the oscil- 
latory energy 


了 一 五 十 瑟 十 再 with 万 = 到 和 十 于 wz， (2.5) 





where Zlj is the Jth component of the lower half zl E 有 3 of z 一 (zo,zZ1)7 e 了 及， 
decomposed according to the blocks of f2 in (2.2). We recall that Z1,j Tepresents the 


480 XII. Oscillatory Differential Equations with Constant High Frequencies 



































项 
四 风 
04 08 1 2 
玖 上 | 六 =2/w 万 
1 了 
[ 

























仙人 遇 


3 届 JANE 
50 100 150 2500 5000 7500 

















Fig. 2.1. Different time scales in the Fermi-Pasta-Ulam model (w = 50) 


elongation of the 7th stifft Spring. Further quantities Shown are the kinetic energy of 

the mass centre motion and of the relative motion of masses joined by a stiff Spring， 
三 过 zz 上， 于 三 记 吉 | 

Time Scale w-!. The vibration of the stiff linear springs is nearly harmonic with 

almost-period rr/w. This is illustrated by the plot of 嫉 in the first picture， 


Time Scale w0. This is the time scale of the motion of the soft nonlinear springs, as 
is exemplified by the plot of 7o in the second picture of Fig. 2.1. 


Time Scale w. A slow energy exchange among the stiff Springs takes place on the 
Scale w. In the third picture, the initially excited first Stifft Spring passes energy to 
the second one, and then also the third stiff Spring begins to vibrate. The picture 
also illustrates that the problem is very Sensjltive to perturpations of the initial data: 
the grey curves of each of 万 , 有 P, Fa correspond to initial data where 10-5 has been 
added to zZzo1(0), zol1(0) and 211(0). The displayed solutions of the first three 
pictures have been computed very accurately by an adaptive integrator. 


Time Scale w>, NV > 2. The oscillatory energy 了 has only O(w-1) deviations from 
the initial value over very long time intervals. The fourth Picture of Fig.2.1 Shows 
the total energy 如 and the oscillatory energy 7 as computed by method (1.10)-(1.11) 
of Sect.XIII.1.2 with the step size 玫 = 2/w, which is nearly as large as the length 
of the time interval of the first picture. No drift is seen for 如 or 7 


XII.2 A Nonlinear Model Problem and Numerical Phenomena 481 


XIIH.2.2 Numerical Methods 


The methods described in 9ect.XIIU.1 all have in common that they reduce to the 
Stormer-Verlet method when they are applied to (2.1) with 942 = 0, and they become 
exact Solvers for the linear homogeneous problem with 9g(z) 三 0. They can be 
formulated as one-step or two-step Schemes. 


Two-Step Formulation. All the methods of Sections XII.1.2-XIHI.1.3, when ap- 
plied to the System (2.1), can be written in the two-step form 


Zn 一 2cos(PPDJzn 十 Zn 1 = 77q(Gzn) (2.0) 


Here 允 = WPO2) and 瑟 = bj2) where the Jilier jnctfions W and bare even， 
real-valued functions with V(0) = %(0) = 1. mm our numerical experiments we will 
consider the following choices of V and 四, where again sinc() = sin /5: 


(4) (6 = sinC(36) 0(E) 一 Gautschi (1961) 

(B) VC = sinc(g) %(6) = Deuflthard (1979) 

(C) Up 人 6) = sinc() 0 06) = Re ) Garcfa-Archilla 儿 al. (1999) 
( 姜 ) (6) = sinC (36) 09(E) of 人 20) “Hochbruck & Lubich (1999a) 
(五 ) (6) 三 Sinc2 ( 绳 (6) = Hairer & Lubich (2000a) 


One-Step Formulation. The method (2.6) can be written as a Symmetric one-step 
method of a form that is motivated by the variation-of-constants formula (1.8). This 
now also includes a velocity approximation Zn: 


Zn 一 cosphPDzn 十 DTsinh22n 十 下 Ngn (2.7) 
2nH1 三 一 snhPDzn 二 cosnPD2zn 十 坟 (Zoogn 十 五 gm (2.9) 





Where gn 三 g(Gzn) and 四 = Vo(O 矶 = (2) with even functions Wo， 
Wi satisfying Wo(0) = 1 Wi(0) = 1. Exchanging7 人 光 十 1and 几 盆 一 Ainthe 
method, it is seen that the method is Symmetric 让 and only 让 


(6) = sinc(6) 加 (9， Wo(9 =cos(9 凡 (9 . (2.9) 
The method is then Symplectic if and only 让 (Exercise 2) 
(5S) = sinc(6) 0(6) . (2.10) 


Two-Step Velocity Schemes. For a symmetric method (2.7)-(2.8) the velocity ap- 
proximation can be equivalently obtained from 


21sinc(hw)2Zn 一 Znl 一 Zn 1 (2.11) 
(for sin(Pw) 夭 0) or from 
2rH1 一 2cos(PD) 吉 十 2 1 三 二 (gui 一 gl1) (2.12) 


The latter formula gives a Symmetric two-step method for arbitrary even functions 
Wi with Wi(0) = 1, which do not necessarily satisfy (2.9). 
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Multi-Force Methods. The methods of Sect. XII.1.6 belong to the class of multi- 
force methods, which generalize the right-hand Side of (2.0) to a linear combination 
of Such terms: 


达 
Za 一 2cos(hD)zmnTTzn il= 岂 》 古 9(Gjzn) (2.13) 


JJ 一 1 


with 太一 (OO 5) = 力 (2) ,where Wi, 0 are even functions with 


>》 屿 0)=1， 册 0)=1 for7y = 了 天 


In our numerical experiments we include the method 


(五 ) two-force method (1.23) with (1.24). 


XIIU.2.3 Accuracy Comparisons 


The accuracy of the methods (A)-(E) and the Stormer-Yerlet method on a short 
time interval js Shown in Fig. 2.2, where the errors att 王 1ofthe different solution 
components in the FPU problem (with w = 50) are plotted as a function of the 
step Size 几 . Here and in all the following numerical experiments, the methods were 
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Fig. 2.2. Global error att 三 1 forthe different components and for the five methods (A) - (了 B) 
and the Stormer-Verlet method as a function of the step Size 几 
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Fig. 2.3. Global error at the first grid point after 上 = 1 for the different components as a 
function of the step Size 几 . The error for method (A) is drawn in black, for method (B) in 
dark grey, and for method (C) in light grey. The vertical lines indicate Step Sizes for which 
Auw equals T, 27, Or 3T 


implemented in the one-step formulation (2.7)-(2.8) with (2.9). The errors in the 
Z0-Components are nearly identical for all the methods in the stability range of the 
Stormer-Verlet method (pw < 2). Differences between the methods are however 
Visible for larger step Sizes. For the other solution components Z1，Z0，Z1 there 
are pronounced differences in the error behaviour of the methods. All fve methods 
(A)-(G) are considerably more accurate than the Stormer-Verlet method. Figure 2.3 
Shows the errors of methods (A)-(C) for Step Sizes beyond the Stability Tange of 
the Stormer-Verlet method. Methods (A) and (B) lose accuracy when jw is near 
integral multiples of Tr, a phenomenon that does not occur with method (C). 


XIIH.2.4 Energy 了 xchange between Stiff Components 


Figure 2.4 showsgs the energy exchange of the Six methods (A)-(E) applied to the 
Fermi-Pasta-Ulam problem with the Same data as in Fig.2.1. The fgures Show 
again the oscillatory energies 7, 172, 13 ofthe stiff Springs,theirsum7 三 三 十 72 十 73 
and the total energy 妃 一 0.8 as functions of time on the interval 0 科 二 冬 200. Only 
the methods (B), (D) and G) give a good approximation of the energy exchange 
between the stiff Springs. It will turn out in 9ect. XIIU.4.2 that a necesSary condition 
for a correct approximation of the energy exchange is 风 (Ppw)g(Ppw) = sinc(P)， 
which is satisfied for method (B). The two-force method (FE) Satisfies an analogous 
condition for multi-force methods. The good behaviour of method (D) comes from 
the fact that here VW(Pphw)g(Ppw) 之 0.95 sinc(Ppwo) for Po = 1.5. 
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Fig. 2.4. Energy exchange between stifft Springs for methods (A)-(F) (六 = 0.03, w = 50) 
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Fig. 2.$S. Maximum error of the total energy on the interval [0, 1000] for methods (A) - (F) as 
afunction of jw (step Size 中 = 0.02) 
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XIIU.2.$S Near-Conservation of TIotal and Oscillatory Energy 


Figure 2.3 Shows the maximum error of the total energy 囊 as afunction ofthe Scaled 
frequency jw (step size 几 = 0.02). We consider the long time interval [0, 1000]. The 
pictures for the different methods Show that in general the total energy ls well con- 
Served. Exceptions are near integral multiples of 交 .Certain methods Show a bad 
energy conservation close to odd multiples of T, other methods close to even multi- 
ples of rr. Only method (E) Shows a uniformly good behaviour for all frequencies. In 
Fig.2.6 we Show in more detail what happens close to Such integral multiples of 7r. 
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Fig. 2.6. Zoom (close to T or 27) of the maximum error of the total energy on the interval 
[0, 1000] for three methods as a function of jw (step size 玉 = 0.02) 
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Fig. 2.7. Maximum deviation of the oscillatory energy on the interval [0, 1000] for methods 
(A) - TD) as afunction of Anw (step Size 玉 三 0.02) 


Ithere is a difficulty close to T, it ls typically in an entire nelghbourhood. Close to 
27, the picture ls different. Method (C) has good energy conservation for values of 
Aw that are very close to 27, but there are Small intervals to the left and to the right， 
Where the error in the total energy is large. Unlike the other methods shown, method 
(B) has poor energy conservation in rather large intervals around even multiples of 
T. Methods (A) and (D) conserve the total energy particularly well for Po away 
from integral multiples of T 克 . 

Figure 2.7 Shows Similar pictures where the total energy 丈 ls replaced by the 
oscillatory energy 7 (cf. Sect.XIII.2.1). For the exact solution we have 7T(b) = 
Const 十 O(w-D.Itis therefore not surprising that this quantity is not well con- 
Served for Small values of w. For larger values of w，we observe that the methods 
have difficulties in conserving the oscillatory energy when hnw is near integral mul- 
tiples of T. None of the considered methods conserves both quantities 妃 and 了 
uniformly for all values of Acw. 
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XIIU.3 Principal Terms of the Modulated Fourier 
了 上 xpansion 


The analytical tool for understanding the above numerical phenomena is provided 
by 71odxlaled Foxrier expa11sio19,， Which decompose both the exact and the numer- 
ical solution into a Slowly varying part and into oscillatory components built up 
of trigonometric functions multiplied with Slowly varying coeffticient functions. A 
comparison of these expansions will serve as a partial Substitute for the backward 
error analysis of Chap.IX, which yields results only for jw 一 0 and is not applica- 
ble to the situation of pw > c > 0 that is of interest here. In this Section we derive 
the first terms of the modulated Fourier expansion. 


XIIH.3.1 Decomposition of the Exact Solution 


Every solution of the linear equation 世上 二 227 = 9(b with 02 of (2.2) can be written 
as zy 介 十 cos(wbu 抽 十 sin(wobv 蚊 二 O(w->) (forw 一 co), where yb， vb)， 
u 人 (bare truncated asymptotic expansions in powers of w-1 (see Exercise 4). These 
functions have the property that all their derivatives are bounded independently of 
the parameter w 六 1. Here and in the following, a soo 妨 function is understood 
to be a function with this property. We may hope to find a Similar decomposition 
for solutions of the nonlinear problem (2.1). So we look for a smooth real-valued 
function y( 轨 and a smooth complex-valued function z( 如 一 忆 区 十 九 ( 区 Such that 
the function 





2 (四 一 V0 十 eetz( 四 十 eetz 人 (人 (3.1) 


gives a Small defect when it is inserted into the differential equation (2.1) and has 
the given initial values 


Under the condition (2.3) the exact solution zZ(t has bounded energy，and we 
may expect the same of the approximation zx (如 ,which would then imply z( 塘 三 
O(w-)， We therefore insert the ansatz (3.1) into the differential equation (2.1) 
and expand the nonlinearity around the smooth part Vy( 力 . With the variables V 三 
(yo,W),z = (20,21) partitioned according to the blocks of 2, this gives the ex- 
pressions 


ee 注 上 as 
六 20 iot 1 一 20 十 23w20 十 2 加 0 
人 6 1 ( 2iw 坟 十 为 】 





一 oot 一 w2Z0 Se 27wZ0 十 Z0 
CE 全 
一 22wZ1 十 231 


and, as long as z( = O(w-D)， 


g(zs) = gg 十 gg(2 司 十 etg (gz 十 eg(y)Z 
十 ee 人 gg)(2 2 十 eg)( 有 下 十 O(wo) . 
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上 quations for the Coefficient Functions， We now compare the coefficients of 
lex%t ee and require that the dominant terms in these expressions be equal: 


加 三 ga) 十 90(O)(z, 习 
w201 = 多 ( 幼 
一 wa20 一 的 (g)z 
2o2 = 9g1(0)2 . 
This gives a System of differential equations for yo, 21 and expresses V1, 20 as 
functions of yo,21. We note that Vo evolves on the time Scale 1，whereas 2 
changes on the slow time scale w. As long as yo( 罗 stays in a bounded domain 
and 2 ( 力 = O(w-), (3.3) implies the bounds 
帮办 =Oo )，20 鸭 =O )， 广 的 =Ow 一) G.4) 
JInitial Values. The initial values yo(0),Vo(0) and 21(0) are obtained from condition 
(3.2), which gives a System that can be Solved by fxed point iteration to yield 
yo(0) =zoo+O(w 5)， 2Vo(0) 一 2oo 十 O(w”) 
2Re 21(0) 二 2Z0,1 十 O(w-2 )， 一 w2Im 21(0) 一 Z0,1 | O(w-2) 


(3.3) 


(3.5) 


Defect. As long as z( = O(w-D,the above equations show that the defect 
db 三 轨 十 27 一 g(zs() 
is of the form 


三 和 WwT2etxwta(t) 十 wT2e2etb( 办 十 O(w-3) 
| Oo ) 


with Smooth functions Qa,p. Together with (3.3) this also shows that the smooth 
O(w- -term % (四 (> 下 is the principal term describing the influence of oscilla- 
tory Solution components on the evolution of smooth components. 


(3.0) 


了 Example. To illustrate the approximation of the solution zt by zx(t of (3.1), we 
have Solved numerically, with high accuracy, the System (3.3) for the FPU problem 
with the data of Sect.I.3.1. In Figure 3.1 we plot the oscillatory energy 了 三 局 十 
12 十 71a with Z replaced by the approximation Z* in the definition (2.3) of these 
quantities. The fgure agrees rather well with Figure I.5.2. 
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Fig. 3.1. Same experiment as in Fig.I.3.2 for the solution (3.1) of (3.3) 
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XIIU.3.2 Decomposition of the Numerical Solution 


For the numerical method (2.6), which solves linear equations 艺 = 一 27 exactly， 
we look similarly to the above for a function of the form 
2 一 信介 十 ee 人 十 et 人 (3.7) 


With coefficient functions wn ( 姓 , 入 ( 切 which are smooth in the sense that all their 
derivatives are bounded independently of 尺 and w, such that zj (t gives a small 
defect when inserted into the difference Scheme (2.6) and has the correct starting 
values: 





21jn(0) 一 Z0， 2Zn(P) 一 21. (3.8) 
Taylor expansion of 次 (人士 尹 ) at the point 上 shows, after some calculation， 
加 08V1o 介 
矶 (za(t+ 月 -2cos(kDJza(D + 一 月 ) = [Ce 0 


iu 省 十 a221wo2.0( 四 十 cos(hw)3o( 十 】 (3.9) 


O2 27w0 轻 (区 十 cos(Pw) 祁 工作 下 





十 1 加 e co11aPplex co11j1U8aie of 1pe exp1essio1m 胡 加 e DJeyviouS Le， 


Where 故人 = (po( 人 1) and 软 伯 = (20( 罗 271( 坟 ) according to the 
partitioning in (2.2)， 


各 和 的 = 访 ( 和 人 月 一 2 的 十 四 一 六 ) 


is the Symmetric Second-order difference quotient, ok 三 sinc( 寺 juo)， and the dots 
stand for higher powers of 疡 multiplied by derivatives of 27.0 or 2,1. Taylor expan- 
Sion of the nonlinearity now gives 


耿 g(Gzp) 三 亚 g(@p) 十 歼 9 (yp )( 瑟 zz 瑟 37) 
十 etwet8g(Goj)21 十 etT70(Go)G2 十 


Modified Equations for the Numerical Coefficient Functions. For the moment 
We conslder the case where the absolute values of cl and cs2 are bounded from 
below by a positive constant，So that Apw is assumed bounded and bounded away 
from a non-zero integral multiple of T. We also assume APw to be bounded away 
from zero，which ls the computational Situation of interest. In this case, the first 
term in each line of each bracket in (3.9) can be conslidered as the dominant one. We 
therefore redquire that the functions yn 2zm Satisfy 


24010 一 9g0( 有 r) 十 外 ( 玉 )( 瑟 zz 2) 


(3.10) 


sinc2( 革 po)w2oj1 = Up) gi(Bon) (3.11) 
O O 
一 Sinc2( 寺 Po) ww2z10 三 宇 组 (@yz)27.0 十 人 (En)0CPw)271 
Ozo0 DOzl1 


(2yj)27.0 十 1 宝 5 


sinc(Pw) 2 斩 1 三 劝 (Po) 
尼 1 


601 
Ozo0 
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The first equation Should be stated more precisely as Vj0 being a solution of a 
modified equation for the Stormer-Verlet method (See Exercise IX.3) applied to the 
Corresponding differential equation: 


交 
加 ,0 二 (4 去 三 和) (go(Bo) 未 gf(Bo)(Bz82)) ; 
Where the time derivatives of yl1, 2 that Tesult from applying the chain rule are 
replaced by using the expressions in (3.11). As long as Vn .0( 力 remains in abounded 
domain and zj.1(b = O(w-), we have again bounds of the same type as for the 
coefficients of the exact Solution: 


011 人 办 三 Oo ，2a20 由 =Olo 3) ， 斩 =Ow (3.12) 


JInitial Values， We next determine the initial values yn ,0(0), Wo(0) and zx.1(0) 
Such that zj(0) and zz( 户 ) coincide with the starting values zZo = Z(0) and zl of 
the numerical method. We let zl be computed from Zo and Zo via the formula (2.7) 
with 7 一 0, and we sttl assume that ol and oo are bounded away from zero. Using 
(3.11), the condition zn(0) = Zo 三 (20.0,Z0.1) then becomes 


Z0.0 二 Wi0(0) 十 O(w-2?zn1(0)) 


(3.13) 
Z01 = 211(0) 十 有 1(0) 十 O(w-) . 





The formula for the first component of (2.7), Z1.0 一 20.0 王 几 Zo.0 十 二 1290(@z0)， to- 
gether with zj.o( 岂 一 Zno(0) = Po(0) 十 31290(@z0) 二 O(13)+O(w-221(0)) 
implies that 

200 = 加 0o(0) 十 O(2) +TO(w 一 21(0))， (3.14) 


For the second component we have from (2.7) 
211 一 cos(ho)zol 三 户 sinc(hwo)2zol 十 二 12Wp(pw) 9i(zo)， 
and by Iaylor expansion and (3.11)， 


Zhu(P) 一 cos(Phwo)zhni(0) 三 (1 =- cos(po))Wz1(0) 十 OU 一 2 
十 isin(ju)( zj,1(0) 一 31(0)) 十 O(Ppuo 一 2 1(0))， 


Where we note the relation (1 一 cos(huw)) yi(0) = 趣 P2W%(pw) 9i(@5yn(0)) by 
(3.11) and a trigonometric identity. After division by Psinc pw = w-1sin po the 
above formulas yield 


201 = 和 (221(0) 一 到 10)) +O(w- 二 Oo zi1(0)). (3.1S) 


The four equations (3.13), (3.14), (3.15) constitute a nonlinear System for the four 
quantities yo(0), V0(0)， w(27.1(0) 十 31(0))， and w(27.1(0) 一 31(0)). By fxed- 
point iteration and using the bounded-energy asSumption (2.3)，we get a locally 
unique solution for sufficiently small 凡 , with z,1(0) = O(w-1) and hence 


490 XII. Oscillatory Differential Equations with Constant High Frequencies 


Vi0(0) = Z00 二 OU ) ， no(0) =zoo+O(O) 
2Rezahi(0) =zol+Ow-)，-w2Imzni(0) =2zol+O(oD . 
(3.10) 
Defect. As long as za1 人 = O(w-T,the defect 
工 
沙 的 = 方 (zatt 十 骨 二 2cos(ADJzn 十 三 一 及) 一 7g(Ezh() (G3.17) 


is of SiZze O(12) by (3.9)-(3.10) and the very construction (3.11) of the coe 便 - 
cient functions. This estimate refers again to the non-resonant case Where al, a2 
are bounded away from zero and hence jw is bounded away from non-zero integral 
multiples of rr. The case of Aw near a multiple of r requires a Special treatment and 
will be considered in the next Subsection. 


XIIU.4 Accuracy and Slow 了 xchange 


A comparison of the Principal terms of the modulated Fourier expansions of the 
numerical and the exact Solution gives much inslight into the behaviour of the nu- 
merical method and the role of the filter functions WwW and 0. From this comparison 
we obtain error bounds over finite time intervals, and we discuss the Slow energy 
exchange between oscillatory components and the Slow energy transfer from oscil- 
latory to Smooth components which take place on the time Scale ww. 


XIII.4.1 Convergence Properties on Bounded Time Intervals 


As afirst application of the modulated Fourier expansion we consider error bounds 
on bounded time intervals. Second-order convergence estimates for more general 
equations 亏 = -47 + 9g(z) with symmetric positive semi-definite matrix 4, uni- 
formly in the (arbitrarily large) eigenfrequencies of 4, are given by Garcfa-Archilla， 
SanZz-9Serna 儿 Skeel (1999) for the mollified impulse method, by Hochbruck 多 Lu- 
bich (1999a) for Gautschi-type methods, and by Grimm 性 Hochbruck (2005) for 
general methods of the class (2.7)-(2.8) with appropriate filter functions. Those 
Tesults were proved with different techniques. The following bounds on the filter 
functions 录 and w are needed for second-order error bounds of method (2.0): 


lp(Pw)| < Cn sinc2( 寺 pu) ， 
1%(pw)| < Ca |sinc( 寺 pw)| ， (4.1) 
(pe)g(hajl < Cslsine(n)| 
Theorem 4.1，Cozsider tpPe 11azericalsolutiozz oftpe yyste11 02.1) 一 (2.3) py 11etpod 
(2.6) Wi 于 astep Size 下 芝 Po (wii GaSUcientmy sa1 Po idependeit of ww) Jor 


WAicpn Po > co > 0.ZLettipe startiag valUe Z1 pe give1 Dy (2.7) wii 态 风 三 0. 太 ipe 
CoO1di1io118 (4.7) da1e saisjied 加 e1a tjpe eror 1 pot1ided py 
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lz 一 z(npl< CCP Jor 六 生 了 . 


和 矿 om1y 1W(po)j| 和 Co lsinc( 寺 po)| polds instead of (4.1) tpen the ordler or com- 
ye1S8e1tce Jeduces 加 o1e: | Zn 一 2 有 Cjor7 玉 近 耻 刺 po 太 casey CT 
idepe1tdet of ww, 岂 a107W 了 1 玉芝 了 ad orpounads ofsolu1iomn derivafives Dot 
depe1ds o1 7 o1 下 of(2.3) om Dot1lds of derrvatives of tpe 11o1Uiearity 9 G11d o1 
C1， C2， C3 O 矿 C0. 


To obtain second-order error bounds uniformly in Au, condition (4.1) requires 
a double zero of mW and a zero of 0 at even multiples of T，and a zero of 亿 
or 0 at odd multiples of T. This is satisfied for the molliftied impulse method 
with %(c) = sinc(), for which WwW(6) = sinc2(6). Gautschi-type methods have 
(6) = sinc2( 革 6), so that the first condition on win (4.D is trivially satisfied. The 
conditions on % hold, for example, for % = Sinc or for 0 of (1.20). The original 
Gautschi method has % = 1, which does not satisfy the second condition of (4.1)， 
and the Deufhard/impulse method (% = sinc, 轴 三 1) satisfies only the third condi- 
tion of (4.1). These latter methods are not of second order uniformly in Pv. 


Poofof71eorem 和 1 (a) First we consider the case where Phw is bounded away 
from integral multiples of rr, So that condition (4.1) is not needed. Comparing the 
equations (3.3) and (3.11), which determine the modulated Fourier expansion coef- 
ficients, Shows 


帮办 一 办 =O( 人 ) ， 因 直 一 2 的 =O(A) 
on bounded intervals, and hence 
Zi 的 一 zs 人 的 =O() ， (4.2) 


The variation-of-constants formula (1.8) and a Gronwall-type inequality Show that， 
on bounded intervals, the error zx( 臣 一 Z(bisofthe same magnitude as the defect: 
by (3.6)， 

2 人 四 一 ZN 一 O(w-2) . 


The errors en 一 Zn 一 21 人 (如 ) Satisfy 
en+H1 一 2cos(hO) en 十 en_1 三 轴 (4.3) 


with bn = 12( 丈 9g(@Bzn) 一 严 9g(Ezp( 如 )) 一 加 (如 )). This recurrence relation can be 
Solved to yield (Exercise 9) 


em 十 1 一 一 克 ，1e0 十 他?Pel 十 >》， 人 7 区 (4.4) 
和 1 
人 0 
作 三 0 0 十 1)Pcw 有 有 
Sin Acww 
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A discrete Gronwall inequality now yields that on bounded intervals, en is of the 
same magnitude as the defect dj (为 of (3.17), which is O(j12) by the construction 
of (3.1D) and by zu,1 = O(w-. Hence， 


zi) = OO . 


Combining these estimates yields the desired second-order error bound. 

(b) We now consider the case where w|sinc( ;ju)|  c with a sufficiently large 
constant c, which depends only on bounds of derivatives of 9. This condition means 
that Ahw is outside of an O( 几 ) neighbourhood of integral multiples of 2r. Under 
conditions (4.1), the equations (3.11) still give 


内 1 提 = 一 Oo ， 因 0 由 =Oo， 斩 1 鸭 王 Oo (4.5) 


as long as zj = O(w-D. Here the first condition of (4.1) gives the bound of 
jn,1l, the Second one the bound of zj,0,， and the third one the bound of 入 ,1. As 
in Sect.XIII.3.2, we determine the initial values yn ,0(0), yz.o(0) and zn,1(0) such 
that zhn(0) and Zn( 户 ) coincide with the starting values zo and zl of the numerical 
method. Using once more (4.1), we obtain a System for the iniltial values Similar to 
(3.13)-(3.15): 


200 = 加 o(0) 十 O(w-1zn1(0)) 

Z01 = 坟 1(0) 十 柬 1(0) 二 O(w-2) 

200 = Zno(0)+O(J 二 O(w-Izj1(0)) 

201 = 各 (zi(0) 一 到 1(0)+O-D 上 +O(zai(0)). 


With the weaker estimates for zu.o( 切 and in (4.6) we still obtain estimates for the 
initial values ofthe type (3.16) with at most one factorw-1l orPlessin theremainder 
terms. Condition (2.3) implies again 2(0) = O(w-D，which ensures that (4.5) 
holds for 0 区 二 芝 了 . The defect is then 人 = O(12), and as in part (a) we get 
the Second-order error bound. 

(c) Now let wlsinc( 计 pw)| < c, so that huw is O(P) close to a multiple of 27. Im 
this case we replace the third equation in (3.11) Simply by 





(4.0) 


《jh,0 一 0. 


Under condition (4.1) we Still obtain the bounds (4.5). The initial values are now 
chosen to Satisfy 


忆 0;0 一 yj0(0) 


几 
2Z01 二 城 这 四 二 天 汪 0 二 让 全 全 二 中 作 0 
SIDC (pw) (4.7) 
2Z0o0 三 人 0(0) 
201 = 记 (zj1(0) 一 有 100)). 


They are then bounded as in (b) and, by the arguments used in the determination 
of the initial values of Sect. XIIIL.3.2, yield the estimates zhn(0) = zo 十 O(]3) 
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and zhn(7j) = 21 十 O(1)， and again zl = O(w-). Since (4.D implies 
gpho)zj1i = O(w-2) in the present situation of |sinc( 寺 po)| < cw-L the de- 
fect is still dr 念 = O(A2). The bound (4.2) is also seen to hold. Therefore the 
Second-order error bound remains valid in this case， 

(gd) If only |%(pu)| < |sinc( 寺 po)| holds, then we replace the third equation in 
(3.11) by za0 = 0. 正 w|sinc( 去 pw)| 所 1, we also set W,1 三 0. The defect is then 
only d ( 轨 三 O( 由 ,which yields the firstorder error bound. 














For the velocity approximation，we obtain the following for the method (2.12) 
or its equivalent formulations. 


Theorem 4.2， Zrder 1Pe co1lditio1ls of 7T1eore11a 纪 .1 co1lsider 加 e velocify GDPProxi- 
Ja1io1m SCcje1ae (2.72) mW 训 ainction Wi satishing Wi(0) = 1 amd 


la(pe)| < CY lsinc( 寺 po)| (4.8) 


Let 1jpe slartizg Values satish 2 = ii(0) amd 2 = 2(P) 十 ]sin(P2)al 二 O(2) 
Wiij al =O(). 7Tjem 加 e eror 六 妨 evelocifies 1 Dotaded py 


2 一 ZE CR or 7 玉芝 工 ， 


WhereCiipdependentorw andmwi 太 7 玉 芭 人 and ofboupnads ofso1ution deriva- 
1ivey DU1 depe11ds oo 玉 of(2.3) o11 Dou1ds ofderimatives ofipe 11o11Uiearity 9， 
a1d on C1 Co, C3 a11d C1. 


Proof (a) By the variation-of-constants formula (1.8), the exact Solution Satisfies 
守候 十 玫 ) 一 2cos(POO)Z 十 2 一 万 ) 
一 三 cos(( 刀 一 5)0) (glztt 十 5) 一 g(z 一 5)】 ds . 
With the modulated Fourier expansion, we write the exact Solution as 
2 明王 1 昌 二 eetz 人 的 十 ee 他 才 二 Oo) 
to obtain 


g(zlt 二 5) 一 g(zt 一 5)) 
= 9 (yb) (2s 8 一 4sin(ws) Im (ez 人 人) 十 O(s2) 十 0 


Using the bounds (3.4), abbreviating 9ij = 99i/0zj and omitting the arguments 上 
and y(t) on the right-hand Side, we therefore have 


宛 人 十 几 ) 一 2cos( 几 IO) 十 2 一 万) 


( 刀 2 90;0 20 2 2 帮 ? sinc2( 去 jw) W 90,1 Im (eetzl) 浊 O(13) 】 
j2 sinc2( 革 pw) gl0go 2172 sinc(hw)wgiiIm(eetza) 二 OO) 
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Wenowusethe discrete variation-of-constants formula (4.4) and partial summation. 
For example, the expresSlion 


人 


sin(7 十 1 工 一 刀 jw 区 
泡 人 J) 芭 2sinc2(17eo) go(y(7 和 ) 加 (7 


Sin /ww 





和 1 


is seen to be O(P) uniformly in jw and for "下 和 全 by partial summation, using 
that the function gil.o(V( 轨 )7o( 区 has abounded derivative and that 


和 (7Tpo) = O( 人 
二 ， w) 一 
In this way we obtain 
ZnR) 三 一 内 -12(0) 十 TmZ (4.9) 


平 人 1 一 0 | 十 O(J . 


(b) For the numerical approximation we proceed similarly. Inserting the modulated 
FEourier expansion of the numerical solution， 


Zr 一 大人 胃 十 et 鸭 (有 十 ee 办 二 Ofor t=7 力 < 工 ， 
into the numerical scheme, we have with (3.12) or (4.5) 
Znp+l 一 2cos(Po)2zn 十 Zn 1 


一 72 ( 9g000.0 一 20%(Ppo) sinc(hwo)wgolIm(etetznl) 二 O( 人 ) 
Wi(Pho)giogo 一 2(000 的 (ho) sinc(po)wgiiIm(eetzrD) 十 O() 


Where the functions 9ij are evaluated at G 沪 (人 坟 and the argument 三 77 is to be 
inserted in yz.o and 217.1. Under the condition (4.8) on W1, we obtain as in (4.9) 


学 计 一 一 和 风 一 1 2Z0 十 全 221 (4.10) 
涉 人 二 58goo(8wn 人 5) 力 ,0o(7 人 】 OU， 
Since we know from the estimates (3.12) and from the proof of Theorem 4.1 that 


Go 从 =V 才 二 O(2) and 太 雪 二 7 十 O(),acomparison of (4.9) and (4.10) 
gives the result. 














XIIU.4.2 Intra-Oscillatory and Oscillatory-Smooth Exchanges 


In this Subsection we turn to the approximation of Slow effects that take place on 
the time Scale w. Since Solutions may depart from each other exponentially，we 
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cannot expect to obtain Small point-wise error bounds on Such a time Scale. Instead， 
we take recourse to a kind of formal backward error analysis where we Tequire 
that the equations determining the modulated Fourier expanslion coeffticients for the 
numerical method be small perturbations of those for the exact solution. It may be 
expected that methods with this property - ceteripus pa1i -Show abetter long-time 
behaviour and this is indeed confirmed by the numerical experiments. 

In the Fermi-Pasta-Ulam model, the oscillatory energy of the 7th stift Spring is 


万 汉 起 21 十 地 wo2z1 
Where Z1j ls the 7Jth component of the lower block zl of Z. In terms of the modu- 
lated Fourier expansion, this is approximately, up to O(w- 浊 )， 

万 下 iozijye 一 ie 各 引 十 二 w2|27e 十 到 et 一 2w2|2 7 . 
The energy exchange between Stiff Springs as Shown in Fig. 2.1 is thus caused by 
the Slow evolution of 21 determined by (3.3). This Should be modeled correctly by 
the numerical method. 

The term gf (V)(z, 2 in the differential equation for yo in (3.3) is the dominant 
term by which the oscillations of the stiff Springs exert an infuence on the Smooth 
motion. A correct incorporation of this term in the numerical method is desirable. 

Upon eliminating V1 and 2zo in (3.3), the differential equations for yo and 2 
become, up to O(w-3?) perturbations on the right-hand sides， 


避 D? _ 
加 = go(yow-2g1(yo,0)) 十 5 (yo,0)(21, 到 1) 二 人 靖 
27w 3 一 <2(00,0) 阅 
DOzl 


This ls to be compared with the analogous equations for the modulated Fourier 
expansion of the numerical method, which follow from (3.11): 


? 
08510 = go( 力 0o7w-291(o;0)) 十 可 号 (n， 0)(27.1; 37.1) 
1 


(4.12) 
22w 坟 1 一 ano0) 2j) 江 
屎 1 
CoCo) Cogj(u) 
加 W 本 2 Us 
二 sinc(Ppo) ” 人 sinc2( 二 po) Da 


The differential equation for zj,1 is consistent with that for 21 only 让 a 三 1,i.e.， 
WU(hw) opwo) = sinc(Po) . (4.14) 


Among all the methods (2.6) conslidered, only the Deuflhard/impulse method (w 三 
sinc, % = 1) satisfies this condition. For this method we indeed observe a qual- 
itatively correct approximation of the energy exchange between Stiff Springs ip 
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Fig. 2.4, but we have also Seen that the energy conservation of this method is very 
Sensitive to near-TresonancesS. 

A correct modeling of the Slow oscillatory-smooth transfer would in addition 
redquire 和 = 1 and possibly 7 = 1. For general jw the condition 7y = 1 is, however， 
incompatible with (4.14). 

Multi-force methods (2.13) offer a way out of these difficulties. For such meth- 
ods, the coefficients of the modulated Fourier expansion Satisfy (4.12) with (4.13) 
Teplaced by 





> Vj(w) 07(puw) 受 
sinc(Aw) 中 党 2 光 ( gr) ， 
7 = 2 (0) 屿 (Jo) 2 . (4.15) 


The two-force method (1.23) with (1.2S) has w = 和 = 了 =1asdesired. 


XIIHL.S Modulated Fourier 了 xpansions 


The decomposition ofthe exact and the numerical solution into modulated exponen- 
tials and aremainder, as derived in Sect.XIIHJ.3, was found useful for understanding 
Several important aspects of the numerical behaviour. Those few terms are, how- 
ever, not Sufficient for explaining the long-time near-conservation of the total and 
the oscillatory energy. The expansion can be made more accurate by adding further 
terms e 士 20t，e 士 3iwt etc. multiplied by slowly varying functions. This leads to an 
asymptotic expansion which we call the 1zodxwiated Foxrier expa11sio1. This expan- 
Sion ls constructed in the present Section, following Hairer 儿 Lubich (2000a). (In 
that paper the modulated Fourier expansion was called the frequency expansion.) 














XIIU.S.1 Expansion of the Exact Solution 


The following theorem extends the construction of Sect.XII.3.1 to arbitrary order 
| 
inw- 


Theorem S.1， Comsider asolutionm z(t orf2.7) wjicp satisjies 1pe poxaded-enerey 
CO1di1io1 (2.3) G1d slayS 1 G co1Dacf Sef 有 Jor0 挟 寺 挟 了 .7T1em tje SolUtio1 
Ga11111S G11 eXDG11S1O1L 


zZ 人 (一 8 的 二 >》 ee 办 十 尽 w 人 (5.1 
0<|K|< 


Jorarpitary N 之 2 WwWjere tpe 7e11aai1zder 1erma ad 1 derivatiye de Dou1nded Dy 


RN 人 =OwrN- 2 and RN 和 ==Owrx DJjor 0<t<T， (2) 
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7T7Je 7eal-valzed jnctions 9 = (yo,W1) and te comaplex-yalued jnctionsy 次 一 


(2 冰 ) 1ogelper wiii all tpeir deripatives ( 友 加 arbitrary order M) are boxnaded 
D 


人 
V1 一 Oo 一)， 好 和 O(w-))， 
Jpr[ = 2...,N 一 1 Moreover z 一 2 Jor al FE 7T1ese jpnctio1ay da1e Vigue 


1UD 1o je11ag of size O(w-N-2). 7T7je comstanmts sy1polized py 态 e O-motafiom a1e 


112depe1de1t orw adtwii0 挟 三 7(DUttzey depend on NT on ror2.3) om 
Dov1zds oftjpe derivatives of tpe nolinearipy g(Z) on 开 ，a1zd o7 加 e JIGXI7UU1NL Order 
Ad of comsidered deriyatiyveS). 


Pioof We Set 


zs 人 (一 3 的 十 >》 eeeta( (G5.4) 
0<|KE|<N 
and determine the smooth functions y( 臣 ，z 介 一 2 and 22( 胃 21 


Such that zx(t) inserted into the differential equation (2.1) has a small defect, of size 
O(w-w).Tothis end weexpand g(z,()) around y(t) and compare the coefficients 
of etkwt. With the notation g(( 力 zc = go (人 (zol ,zam) foramulti-index 
Q 三 (al,...,am)j,there results the following system of differential equations: 


( 十 全 = 9( 


一 w220 22w20 村 20 1 (mp) Ca 
( 2iu 罗 】 二 ( 当 二 


1 
-一 gm)()z2 (5.5) 


IT 


加 





s(a) 王 1 
一 52w2z 21i 庆 十 2 动 
人 一 12Jo2 才 人 2kiwo 示 十 水 几 一 乞 ， 机 9” (的 2， 6G.7) 
Here the sums range over al 几 > 1 and all multi-indices a = (al, ,am) with 


integers oj Satisfying 0 < |aj| < N, which have a given sum s(a) = 2 产 1 0 
For large w, the dominating terms in these differential equations are given by the 
left-most expresslions. However, Since the central terms involve higher derivatives， 
we are confronted with singular perturpbation problems. We are interested in smooth 
functions y, z, z& that Satisfy the system up to a defect of size O(w- 六 ). In the spirit 
of Eulers derivation of the Euler-Maclaurin Summation formula (See e.g. Hairer 多 
Wanner 1997) we remove the disturbing higher derivatives by using iteratively the 


differentiated edquations ($.5$)-(9.7). This leads to a System 


0 岂 一 1 一 工 > ，, 一 1 人 工 一 1 一 | 
加 三 0000 2 ) 后 2 二 Jo ) ) 

一 ，) 一 2 了 1 六 一 1 一 工 _ ，, 一 2 工 JV 一 1 一 工 
Z0 二 Go0(000 25 ) CO 及 UL 二 w Go 2 ) ) 
让 富 w72G4(0o 20 二 0 2 二 w72G8(00， 2 人 
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Where .万 ,979 的 are formal series in powers of w-1. Since we get formal algebraic 
relations for Wi, z0, 2，we can further eliminate these variables in the functions 
Ch 9 We finally obtain for Wi, 21, 2 the algebraic relations 








20 = woT2(Goo(yo, 加 袜 ) 十 w 1Gol(yo, 加 0, 襟 ) 十 …) 

呈 = w72(Gio(lyogo2) 十 orIGii(yogo2) 十 …) 8) 
闪 = w72(G 和 (oo) 十 wrIG 人 (yo,go; 袜 ) 十) 
0 00\V0，V0， 2 01\Vo, Vo， 2 … 

你 = woT2(Gfo(yo oz) 二 wrGY(o go2) 十 …) 


and a System of real Second-order differential equations for yo and complex first- 
order differential equations for 21 : 


加 = Foo(lyoyoz) 十 wTTFol(yopo) 十 …. 


， . (5.9) 
电 = wTI(Rmo(yo,po2) 十 wmai(yooo2) 十 .….). 


At this point we can forget the above derivation and take it as a motivation for the 
ansatz (9.8)-(9.9), which ls truncated after the O(w- ) terms. We insert this ansatz 
andiits first and second derivatives into (3.3)-(3.7) and compare powers of w-1.This 
yields recurrence relations for the functions 环 帮 G 和 ， which in addition show that 
these functions together with their derivatives are all bounded on compact sets. 

We determine initial values for ($.9) such that the function zx(t of (5.4) satisfies 
2x(0) 一 Zo and zs(0) 一 zzo. Because of the special ansatz (5.8)-(S.9), this gives a 
System which, by fxed-point iteration, yields (locally) unique initial values yo(0)， 
2o(0), zi(0) satisfying (3.5). The assumption (2.3) implies that zi(0) = O(w-).I 
further follows from the boundedness of 灵 ; that 2 人 = O(w-D for0 区 夺 去 了. 
Going back to (9.7), it ls Seen that the functions @GA contain at least 5 times the 
factor 21. This implies the stated bounds for all other functions，. 

Itremains to estimate the error 尺 N 人 (一 2( 一 0x( 坟 .Forthis we consider the 
solution of (5.8)-(9.9) with the above initial values. By construction, these functions 
satisfy the system ($.$)-($.7) up to a defect of O(w-). This gives a defect of size 
O(w-w>) when the function zs(t) of (5.4) is inserted into (2.1). On a finite time 
interval 0 < < Tthis implies Rv( = O(w-N) and RN 人 = O(w-N). To 
obtain the slightly sharper bounds (3.2), we apply the above proof with N replaced 
by w+2andusethe bounds (3.3) for 2 and zm+1L. 














XIIH.S.2 Expansion of the Numerical Solution 


Does the numerical solution of (2.1) have a modulated Fourier expansion Similar 
to the analytical Solutiony This may of course be expected, but in Sect. XII.3.2 
we encountered difficulties in constructing the first terms of the expansion in the 
Situation of a numerical resonance where Ahw is close to an integral multiple of 7. 
We therefore confine the discussion to the non-resonant case. We asSsume that 尺 and 
wrl lie in a subregion of the (],w- 蕊 -plane of small parameters for which there 
exists a positive constant c Such that 
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|sin(3iKpo)| > cv 六， for 有 = N，with N >2. (5.10) 


This condition implies that pw is outside an O( V) nelighbourhood of integral mul- 
tiples of T. For given 几 and w, the condition imposes a restriction on NV. In the 
following, N is a fxed integer Such that (5.10) holds. There is the following numer- 
ical analogue of Theorem 3.1. 


Theorem S.2，Co1zsider tpe 1aiericalsolutiozz oftpe yyste11 (2.1 一 (2.3) py 11etpod 
(2.0) wii step size 及 Zeftpe starting valUe Z1 pe give py (2.7) Wi 碎 风 = 三 0.48su1Ie 
jw > co > 0, 1jpe 1o1-1eso1aa1tce co1a1dition (9.70) da1d 1pPe potnadsy (4.1) Jor mp 
010 gw). 7TPem 1pe 7111ae1ical solution Gd1ait8 Ga12 exXpa11Sio7z 


Zr 一 及 办 十 >》 eet 驮 ( 胃 十 尼 w 人 (5.11) 
0<|K|<N 


Zoo11ly jor0 芝 上 三 7 玉芝 了 .7T1e 7e1iaizaderiermais oripe 加 1771 
尽 RN 人 三 妇 1 有 r 的 mw 可 7 提 =O(OPo)> 二 jpm) ， (S.12) 


Where mm 之 0ca1n pe chose1 Ga1Di11a1z 7TPe coe 太 cient Jactio11sy 1ogetper withp al 
1jeir derivatives (UPD 10 so111e Q1Di1GFiy Jixed ordem are pox1nded py 


210 一 OUlD)， z0 一 O(w-)， 这 0 7 O(w-)， (5 13) 

gp =Ow 9)，a1=Ow )， 罗 1 一 OU ) 
Jor8 = 三 2),...,N 一 1 Moreoyver 0 三 冰 Jor QU KE. 7T71e co11s1a118 Sy11Dolized By 
1jpe O-1otafion d1e idepe1tdet ofw ad 凡 Yi 了 (3.70) put 1ey depe1d on 一 和 N， 
710， C G1G 了 . 


The proof covers the remainder of this Supsection. It constructs a function 


Zhi 昌 一 久 的 十 >》 epe 允 的 (5.14) 
0<|K|< 


with smooth coefficient functions u(t) and zx 次 (办 , which has a small defect when 
it ls inserted into the numerical Scheme (2.0). The following functional calculus is 
convenient for determining the coefficient functions. 


Functional Calculus. Let 矿 be an entire complex function bounded by |jG)| 芯 
C elcl. Then， 
NS (0) 
JADjzg = 半 央 z 人 全 


下 一 0 





converges for every function Z which is analytic in a disk of radius 7 > 7R around 世 . 
下 万 and 户 aretwosuch entire functions, then 


万 (D) 户 AD)zO = ( 矿 户 )D)zO 
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Whenever both sides exist We note (DJRz(b = hzto(b for 天 = 0,1,2, .and 
exp(PD)Jz 坟 王 Z 十 轧 . 

We next consider the application of such an operator to functions of the form 
etetz( 四 . By Leibniz” rule of calculus we have (]D)8eietz 人 办 一 etet(jD 十 
zjojsz(t. After a short calculation this yields 


JRD)eetz 提 = eeFRD 二 io)z( (S.15) 


where jpD +iho)z 人 = 并 已 OCRw)/ 让 xz 人 ， 
An N-times continuously differentiable function Z is replaced by its Taylor 
polynomial of degree N 一 1 at 纪 and F(PD)Jz(b is then considered up to O(PpN). 


Modified Equations for the Coefficient Functions. The difference operator of the 
numerical method becomes in thlis notation 


zt 十 四 一 2coshDz 四 二 zt 一 站 =(e 吧 一 2cosnhP 二 ez 
We factorize this operator as 


CD) := e2 一 2cospn2+er2 = 2(cos(iD) 一 cospnD) 


(3.10) 
= 4sin(3AO2 二 3AD)sin(mD 一 二 7D) . 
The function zn( of (5.14) should formally (up to O(JA +2)) satisfy the difference 
Scheme 
ChD)zn(b = 12og(Gzpn()， (5.17) 


We insert the ansatz (3.14), expand the right-hand side into a Taylor Series around 
G 凡 (and compare the coefficients of etkwt. This yields the following formal 
equations for the functions 妨 ( 轨 and z 人 (四 : 


工 
上 双 一 (mm) Ge 
CDJ)o = 72g(gB+ > 历 900(Bn)(Ba)o 
， 和 G.19) 
CRD+ijpo) 寺 = 72m >， go0(Go)( 瑟 zz)2. 
s(a) 一 有 
Here, a = (ail, ,am) is amulti-index as in the proof of Theorem 5.1, s(a) 三 
到 oa 六 and (有 z)2 is an abbreviation for the mm-tupel (Gz21,...,zom). To get 


smooth functions yj ( 轨 and 次 ( 旭 which solve (5.18) up to a small defecb we look 
at the dominating terms in the Taylor expansions of C(PD) and C(PD 十 大 po) 
With the abbreviations sk = sin( 寺 KEjhw) and cx 一 cos( 寺 fjw) we obtain 


(6 | 一 6 | ( 太 D) 十 .…: 


1.2 
二 庆 ( 3 0)+ae 人 0 1 ex 


CUPD) 
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， 452 0 1 0N ， 
C(AD 二 hu) = ( 0 | 十 252 和 ( 芭 门 ) 


1 0 ， 
一 Co2K 让 GD 


Construction of the Coefficient Functions. Under the non-resonance condition 
(3.10), the first non-vanishing coefhcients in (3.19) are the dominant ones, and the 
derivation of the defining relations for yj and 2 js the Same as for the analytical 
Solution in Theorem 3.1; See also part (b) of the proof of Theorem 4.1. We insert 
(3.19) into ($.18) and we eliminate recursively the higher derivatives. This motivates 
the following ansatz for the computation of the functions Vn and 2 


入 0 = Jo() 二 V Joi() 十 六 Joz() 十 .…. 
冀 1 三 CjoO+ vaO+， 


六 2 
。 = 总 ( 喇 二 VR 9 人 CO) 十， 
吉 


2Vj,1 一 人 (9.20) 


克 。 = 每 ( 鸣 O+ VE 咯 GD) 


天 
UP)j2 
状 一 人 (的 CT 人 十 小， 
SK 二 1SK 一 1 
for 8 = 2,...,N -1 where the functions depend smoothly on the variables 


思 0, oo gp) 熙 and on the bounded parameters V 太 /sj， sj， ch W(Po) and 
(Po)-1. Inserting this ansatz and its derivatives into (3.18) and comparing powers 
of v yields recurrence relations for the functions 用 ; 9 放 . The functions 9 (for 
及 > 1) contain at least times the factor %( Po) 大 、， and 万/ contains this factor at 


least once. Since the Series in (9.20) need not converge, we truncate them after the 
ANT+m+2 terms， 


JInitial Values. The conditions zz(0) = Zo and zz(P) = 21 determine the initial 
values 1.o(0), yj.o(0) and za,1(0) in the same way as in Sect. XIIIL.3.2. Condition 
(4.1) yields again (4.6), and (2.3) then implies 2.1(0) = O(w-D). 

Defect， It follows from (4.1) that pp(Ppo)d(pwo)/sa = O(w ,so that 才 1 = 
O(w 一 六 1) by (5.20). This implies 2 1 = O(w-D fort < 了 .The other esti- 
mates (5.13) are directly obtained from (3.20), which indeed yields the following 
more refined bounds for the coeffticient functions together with their derivatives: 
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Wo=O()，W=O(-) 
奈 0 运 O(w-3/V)， 大 1 一 O(w 一 )， 康 1 二 O(w-”) (5.21) 
50 一 OUgoJ wo) 二 Oo)ghw) wo )， 
Consequently, the values zjn(nA) inserted into the numerical scheme (2.6) yield a 
defect of size O(PNT+2): 

Zhi 人 十 尹 ) 一 2cos(RD) zt 十 Zn 一 尹 ) 一 


.22 
= 了 (9(Bzn(b) +O(b(hw)Nw-N 十 让 4 
Standard convergence estimates then Show that, on bounded time intervals，Zmn 一 
Znh(mP) is of size O( 妇 7 ) and actually satisfies the finer estimate (5.12). This com- 
pletes the proof of Theorem 5.2. 














XIIU.S.3 Expansion of the Velocity Approximation 


Asimilar expansion holds also forthe velocities. We Show this forthe Scheme (2.11) 
or its equivalent one-step formulation (2.8) with (2.9). 


Theorem S.3，V7mder 加 e asSU1ID1io1s Of 7T1eore1a 59.2, 1je velocify GaPProxiIatio1 
Zn 8iVe1 DYy (2.77) Has a11 exDa11S1O7 


家 二 >》 eet 上 有 十 OA 
0<|K|< 


Waiy jpr0 芝 二 =7 故 近 人 wpere tpe yeal-yvalued jzactionsy wu 三 (Un 0 UP TD) 


G1C 11e co11DUex-yVGLUed jj2ctio118 局 一 (LO 0， 1 1) 1ogetjper mi 态 加 eir derivativeys 


2UD 1o arpitjary order satisj 


oji0 三 纪 0 十 O()， 0 一 O(w-)，aoo= 王 Oo (5.23) 
1 io 专 1 +O- )，wp=O 下)， 8 一 Oo) 
Jor 有 = 三 2 ..N 一 1. Moreover 了 三 六 7je co1lsta11sy Sy1Dolized py 加 e 


O-7notation ae jdepe1de1t of an1d 凡 Wi 碎 (5.710) pu depend on 瑟 , N， c ad 工 . 
Pioof Let 凡人 (tbe defined by the continuous analogue of (2.11)， 
21.sinc( 尹 TO) Up 人 三 2Zp( 人 十 帮 一 2 人 一 万 ). (3.24) 
Theorem 5.2 then yields that 
2 二 人 (办 十 O(2pD) 


fort 一 mA on bounded time intervals. Here we used that the remainder term in the 
lower component of (5.12) is of the form O(V(Ppo)(O(pw) 十 站 妇 Pw), so that its 
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quotient with 2 刀 sinc(jo) becomes O( 妇 ji -1) by the third of the conditions (4.1) 
and by (5$.10). The function wp ( 妇 can be written as 


oa 人 的 十 提 二 >》 eeetu 人， (5.25) 
0<|K|< 


We insert the relation (S.14) into 一 ;sin(zAD)zZn 坟 三 天 sinc(P2)uz( 坟 ,which is 
equivalent to (5$.24), and compare the coefficients of eikwt to obtain 


Sinc( 人 ?万 ) 27.0 一 VR,0 


sinc( 包 万 ) 人 1 三 Sinc(jpwo)vn il 


(3.20) 
(1 sin(zD 一 大 po) 汰 二 2 0 
( 坟 )-1 sin(zjD 一 Kpo) 砍 1 = sinc(po)aop 
forg 三 1 .. ,AN 一 1Iparticular for 1 we get 
cos(hwo) .，， 
0 一 iu cos( 记 万 ) 2 一 训 0 sin(? 万 ) 2 二 (5.27) 


With the above equations, the estimates now follow with the bounds (3.21) of the 
coefficient functions and their derivatives, using again (4.1). 














XII.6 Almost-Invariants of the Modulated Fourier 
了 xpansions 


The System for the coefficients of the modulated Fourier expansion of the exact 
Solution is Shown to have two formal invariants, which are related to thetotal and the 
oScillatory energy. In particular, this explains the near-conservation ofthe oscillatory 
energy Over very long titmes. Analogous almost-invariants are Shown to exist also for 
the modulated Fourier expansion of the numerical solution. This forms the basis for 
Tesults on the long-time energy conservation of numerical methods, which will be 
given in Sections XII.7 and XII.8. 


XIIU.6.1 The Hamiltonian of the Modulated Fourier 上 xpansion 
The equation (2.J) is a Hamiltonian System with the Hamiltonian 


刀 (z,2) = 下 2 十 0227 十 U(z) . (6.1) 


In the modulated Fourier expansion of the solution z( 轨 of (2.1), denote y0( = 
(and (一 eteetz 人 (0< 下 < V),andlet 


站 二 (国人 1 人 ,0 .4 下) 相 
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By (S$.5)-(3.7) these functions Satisfy 


， 1 7 CQ 一 
苏 +22 闪 = 一 >》 二 Try +Ow yy)， (6.2) 
7 
5s(a) 一 有 
Here, the sum is over al mm > 0 and all multi-indices a 三 (al,. ,am) with 


integers oj (0 < |oj| < N) which have a given sum s(a) = 症 产 1 oj, and we 
Write y2 = (y21 ,yam). We define 


1 
WU) = (+ > 二 To)y” (6.3) 
7 
s(a)=0 
From the above it follows that y( 如 satisfies the system 
汰 十 22 几 = 一 VseU()TO(w-) (6.4) 
which, neglecting the O(w-w) term, is the Hamiltonian system (cf. Exercise 6) 





4 三 六 全 全 二 一 3) (0.5) 
TH(y .一 RNT 一 人 NT PD2vR 
y, 习 一 5 六 (G ) 六 二 (人 ) +2() (6.6) 
|R|<N 


Theorem 6.1.， Lader 1pe assu1ID1io11s of 7T1eoye111 3.7 1pe Famazlioia1z or tpe 1iod- 
ULated Foxrier eXpa1181O11 SG1is1ies 
MG yb) = MG(OD)YO)+O( ) (6.7) 
yy) = 瑟 (z(02D)+OUw ) (6.8) 


71e co1sta11s Sy1polized py @O ad1e idepende1t ofrw adtw 了 0 区 寺 近 了 Do 
depe11d o1 古 ，NV a1d 工 . 





| 


Proof Multiplying (6.4) with (7 一 P)7 and summing up gives 


> ， ( 们 9 六 +22g) =- 人 2 +O 一 )， 
||< NV 


Integrating from 0 to tf and using y 下 一 ZR then yields (0.7). 
By the bounds of Theorem 5$.1, wehave for0 芯 寺 所存 


(7, 本 三 让 加 上 二 直下 近 二 co3 玫 必 + 二 O(C- (6.10) 
On the other hand, we have from (6.1) and (3.1) 
豆 (z, 字 一 引 六 | 检 十 让 瑞士 开 上 十 直 二 上 VD)+O(w 06.1D 


Using 旭 = eietz and 仙 = eiet( 谋 十 记 杂 ) together with yT1 = 好 ,itfollows 
from 计 一 OU ) that 碳 十 人 一 和 (由 一 抽 ) 二 OU 9 and| 寻 | = wo 二 


O(w-I). Inserted into (6.10) and (6.11), this yields (6.8). 
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XIII.06.2 A Formal Invariant Close to the Oscillatory Energy 


In addition to the Hamiltonian Mt(y,y), the system for the coefficients of the mod- 
ulated Fourier expansion has another formally conserved quantity. This almost- 
invariant depends only on the oscillating part and is given by 


TO 人 = 一 各 >》 (人 7 做 . (6.12) 
0<|El<N 


This turns out to be close to the energy of the harmonic oscillator， 


T(z;2) 三 下 人 | 十 寺 o2llzall2 (6.13) 


Theorem 6.2， LV1mder 1Pe assu1IDHO1S OF7T1eore1z 9.1， 
Z(0y(0,yG) = ZTC(0),y(0O)+TOC 一 ) (6.14) 
TDyYG 的 ) = Tc 2bD)+TOCw (6.15) 


71e co1sta111s Sy1polized py @O ad1e idepende1t ofrw adtwm 了 0 区 寺 近 了 Dot 
depe11d o1 环 ，NV a1d 工 . 


Prooj For the vector 


y( 入 ) 玛 (et TDAy/ V+ 二 ..,e i》y/ 1 90 et . 2 0 





the definition (6.3) of 2f shows that ZL(y( 入 )) does not depend on 和 X. Its derivative 
with respect to 入 thus yields 


Q 


二 可 ;1 LE 和 AR 
0 = 去 VCGO) = > ， 请 ex(y)7 VCY(A)) ， 
0<|Kl< 
and putting 入 三 0 we obtain 
> ”， 饮 人 (TVUC) =0 (6.10) 
0<|El<N 
for all vectorsy 三 (VANAT+1L .00 .VAN 一， 


The proof of Theorem 6.2 1s now very Similar to that of Theorem 6.1. We mul- 
tiply the relation (6.4) with 一 zwk(y 一 人)7 instead of (全 7. Summing up yields， 
with the use of (0.10)， 


-io >》， (六 7T( 基 TD2) = Ory)， (6.17) 
0<|K|<N 


The time derivative of Z(y,y) of (6.12) equals 


Q RE 
TO 胃 = 一 和 六 大 +( 扩 7 619) 
0<|8|< 
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In the sums > K(o 人 22 and > KE(O 7V ,the terms with 有 and 一 上 can- 
cel. Hence, (6.17) and (6.18) together yield 


7(7 力 = Or ， 


which implies (6.14). 
With 从 = eiket( 演 十 wz = 和 oo 十 O(w-l,itfolows from the bounds 
of Theorem 3.1 that 
TO 了 ) = 2 村 二 Oo) 


On the other hand, using the arguments of the proof of Theorem 6.1, we have 





IT 划一 旨 天 十 抽检 十 直人 天 十 天上 OCT 三 234 检 +O(w-)， 





This proves the Second statement of the theorem. 











Theorem 6.2 implies that the oscillatory energy ls nearly conserved over long 
times: 


Theorem 6.3. 矿 1Pe so1ufiomn z(b off(2.1) stays 六 acompactsetjpor0 艾 大 <wAN， 
1je1m 
T(z(bzD) = TUz(0),2(0)TO(w- +Oo ) 


T7e comstamis Sy1polized py O ae 加 dependent ofw andt wii0 芭 过 去 WAN，DU 
depe1d on 忆 a1d 和 N. 


Prooj With a fixed 7 > 0, let yj denote the vector of the modulated Fourier ex- 
pansion termas that correspond to starting values (Z(7T),Z(77)). Fort = ( 十 0) 代 
with0 区 0 < 1,wehaveby (6.15) 


| 
| 
加 
3 

司 
ss 
要 
辐 


1 十 


We note 

ZUyj+H1(0) yjH1(00)) 一 TO0) ,70)) 二 O(w ~) ) 
because,， by the quasi-uniqueness of the coefhcient functions as stated by Theo- 
rem 3.1, we have yj+1(0) = yj(Z) 十 O(w-w>) and yj+1(00) = yT) 十 O(w- > )， 
and we have the bound (6.14) of Theorem 6.2. The same argument applies to 
T(y(07),y (07T) 一 IOy(0),y (0)). This yields the result. 
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In a different approach，Benettin，Galgani 儿 Qiorgilli (1987) use a_ Sequence 
of coordinate transformations from 了 Hamiltonian perturbation theory to Show that 了 
has only Small deviations over time intervals which grow exponentially with w, in 
the case of an analytic potential L/. By carefully tracing the dependence on N of 
the constants in the O(w-)-terms, near-conservation of 7 over exponentially long 
time intervals can be Shown also within the present framework of modulated Fourier 
expansions; See Cohen, Hairer & Lubich (2003). 


XIIH.0.3 Almost-Invariants ofthe Numerical Method 


We Show that the coeftcients of the modulated Fourier expanslion of the numerical 
solution have almost-invariants that are obtained Similarly to the above. We denote 


EE 一 A++L 一 L ,0 ，1 人 一 1 
一 ( 响 ) ) UVP UP ) 
/~ 一 N+L 一 1 -0 -1 N 一 1 
Z7 一 (2 0 


with 凡人 失 一 六 办 = 大人 and 几 胃 一 etiseto( 有 ,Where yp and 次 are the 
coefficients ofthe modulated Fourier expansion of Theorem 5.2. Similar to (0.3) we 
conslider the function 


1 77 CQ 
In = DG) + >， 而 避 (8 有 (Gy ， (6.19) 
s(a)=0 
where the sum is again taken overall7m > 1 andall multi-indices a = (Qi,. ,am) 


with 0 < |oj| < N for which s(a) = 2 a51 = 0. It then follows from (5.22)， 
multiplied with 力克 15, that the functions %f (区 satisfy 


ZE5j2C(D) 央 = 一 YATOOD)， (6.20) 


where C(AD) of (3$.16) denotes again the difference operator of the numerical 
method. The Similarity of these relations to (6.4) allows us to obtain almost- 
conserved quantities that are analogues of and Z above. 


The First Almost-Invariant， We multiply (6.20) by (7 7， and as in (6.9) we 
obtain 
> ( 狼 82COAD) 砍 十 元 (Cn) = OU ) ， 
| 丰 |<N 
Since we know bounds of the coefficient functions zf and of their derivatives from 
Theorem 35.2, we Switch to the quantities 2 and we get the equivalent relation 


d 
>》， ( 浙 一 pw 本 克 8 CORD 二 wo 间 允 二 元 砍 () =O(A).， (6.21) 
|E< 


We Shall show that the left-hand side is the total derivative of an expression that 
depends only on 有 and derivatives thereof. Consider first the term forK = 0. The 
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Symmetry of the numerical method enters at this very point in the way that the 
expression C(hD)y = 127 十 cih4y(4) 十 ceh6y(6) 十 . .contains only terms with 
derivatives of an even order Multiplied with 好 ,even-order derivatives of y give a 
total derivative: 








， d 1/. 机 攻 
87yCD 元 (gg 1) 一 5 3) .于 (1 DZ9WCHD) 平 


Go)ry@)， 
Thanks to the symmetry of the difference operator C(P 万 ) only expressions of this 
type appear in the term for 大 = 0 in (6.21), with z0 in the role of y. Similarly, we 
getforz 一 软 andz 一 zwith0< jpl < 


工 
2 





























Re 订 zC20D - Re 了 (和 xD .二 (zt-D)7ztHD 土工 (zD)7zO 
d 和 
RezrzerHD _ RedfarsD 二 (CeC-DjzrzCtD 干 工 (zjzrzg 
< ay 2 
Im 六 >(21+1 一 Im 也 汶 >z(020 5 训 2(24-0 二 (2 )7z0 1) 
过 
Imz7zt2l+2) 一 Im 多 人 一 计 z(20 DR | 
人 


Using the formulas ($.19) for C(PDD 十 认 Rwj,itis seen that the term forK in (6.21) 
has an asymptotic 几 -expansion with expressions of the above type as coefficients. 
The leftrhand Slide of (0.21) can therefore be written as the time derivative of a 
function 7tz[zpj( 切 which depends on the values at t of the coefficient function 
vector zh and its first N time derivatives. The relation (6.21) thus becomes 


元 和 [人 的 = OUN) . 


Together with the estimates of Theorem 3.2, this construction of 克 ， yields the fol- 
lowing result. 


Lemma 6.4， Lader 1Pe asSsU1D1iO118 of 7T1eore11a 9.2, 1je coe 访 cie1t zactio11y Z7 一 


(Ce ER 久 订 3 217， 记 0 宝 or ipe 1zodtulated Forrier expa118iO1 OF1Pe 7120- 
11IeE1iCGQL SOULUIO1 Sa11S 帮 
7 人 = 7 基 (0) + O(thy) (6.22) 


Jor0 艺 寺 所 了 .Moreoyer 


7 区] 介 一 要 |gao(b 必 十 c(Peo) 2w21 基 要 十 C(Bn 信 ) 上 +O(A2)，(6.23) 














Where a(Ppwo) = Sinc(hw)J0O(Pw)/AVp(Po)， 
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The Second Almost-Invariant. By the Same calculation as in the proof of Theo- 
rem 6.2 we obtain for 2 (y) ( 娘 ) of (6.19) 


上 二 >》， io (VC(y) 
0<|K|<NV 


It then follows from (6.20) that 


一 和 >》 8( 科 77512C(D) 外 = OU )， 
0<|E|<N 


Written in the > variables, this becomes 


一 和 >》 (十 j2CODD 二 翅 o 癌 过 = OO) (6.24) 
0<|E|<N 


As in (0.21)， the left-hand expression can be written as the time derivative of a 
function 友 [Zo]( 轨 which depends on the values at t of the function zj and its first 
AN derivatives: 


广 [zj 人 = ON) . 


Together with the estimates of Theorem 5.2 this yields the following result. 


Lemma 60.S， Lader 1pe asSU11Dtio1S of 7T1eore111 5.2, 1je coe 太 cient Jazzctions ZP OF 
1je 11odulated Foxrier expa118io1 Of 1pe 11U1IericaL SOLUIO1 SG1IS 


记 罗 的 = 多 基 (0) + OU) (6.25) 
Jor0 芯 二 所 了 .Moreoyer 


方 罗 (= ca(po)2w21 基 :(0 上 要 十 OU) ， (6.26) 





We1e da8aiz ol(Ppw) = Sinc(phw)g(Rw)/VpD(Po). 











Symplectic methods have WE) = sinc(6C) %(6) and hence ac(pw) = 1. To be 
able to also treat methods where o(Pw) can be small, we need to Sharpen the esti- 
mates of Lemma 6.3. Close Scrutiny of the equations (3.20) that determine the co- 
efficient functions of the modulated Fourier expansion, shows that the O(j12) term 
in (6.26) contains a factor %(juo)2, and that the OU ) term in (6.25) can be put in 
the form O(to(pwJxPpN) 二 OGANTm) with an arbitrary integer7mn > 0; cf (S.12). 
Assume now that 


0 is analytic with no real zeros other than integral multiples of 7r. (0.27) 


This condition ensures that |b(Po)|2 > chm for some mm 这 po satisfies (5$.10). 
Under the conditions of Theorem 5.2, in particular (4.1) and (35.10)， the Improved 
bounds of the remainder terms yield the following estimates for 友 三 肥 /c(Po): 
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甩 B 由 = 五 区 (0) 二 OU) (6.28) 
肪 区 的 = 2w| 革 :的 上 二 OO (6.29) 


Relationship with the Total and the Oscillatory Energy. The almost-invariants 





1 人、 二 国生 
到 = -六 ， 二 和 人 sj 全 (6.30) 


of the coefficient functions of the modulated Fourier expanslion are then close to the 
total energy 万 and the oscillatory energy 了 along the numerical solution (Zn Zn): 


Theorem 6.6. ZL1der 1pPe co1mditio1s of 7Tjeore11ay 5.2 a11d co11ditiom (6.21)， 


Itp[Zp] (区 < Tt7[Zz](0) 灶 O(AN) ) 7 [zz] ( 吉 一 及 [|ZP](0) 让 Oh) 
7 妨 [的 三 五 (oo2zm 二 ON， 有 环 区 的 = Tczn2zn) 二 OO) 
jpolds jpor0 芝 上 =7 凡 近 了 Te co1zlstan1s Sy11polized py O depe11d o7 环 ， N a1d 7 


Proof The uppertworelations follow directly from (6.22) and (6.28). Theorems 5.2 
and 3.3 Show 


wzZn1 一 (eet2j 1 人 十 e 2 人) 十 O(P) 
1 三 iu(e21( 一 etoj di 人 十 O(P) . 
With the identity ll 十 可 十 几 一 可 2 三 4 this implies 
Tc 一 2 121 的 此 十 OO ， 


A _ comparison with (6.29) then gives the stated relation between 7 and Z7. The 
relation between 妃 and tj is proved in the Same way, using in addition (0.23). 














XIU.7 Long-Time Near-Conservation of TIotal 
and Oscillatory Energy 


With the resujlts of the previous Section, we can now Show that the numerical method 
nearly preserves the total energy and the oscillatory energy 7 over time intervals 
of length Cj -AN+l, for any N for which the non-resonance condition ($.10) is 
Satisfied. Such aresult js due to Hairer 儿 Lublich (2000a). 

For convenience we Testate the asSsumptions: 


e the energy bound (2.3): 志 | 之 (0)| 十 志 ||42z(0) 上 2 < ; 
e the condition on the numerical solution: the values OZzn stay in a compact 
Subset of a domain on which the potential Z is Smooth; 


XII.7 Long-Time Near-Conservation of Total and Oscillatory Energy 511 


the conditions on the filter functions: W and 0 are even, real-analytic, and have 
no real zeros other than integral multiples of r; they satisfy WwW(0) = %(0) = 1 
and (4.1): 


(pw)l < CC sinc2( 寺 jhw) ， op < C2 sinc( 寺 pw)| ， 


人 
所 大 莹 ae 


the condition pw > co > 0 ; 
the non-resonance condition ($.10): for some N > 2， 


|sin(3Kjco)| > cvVP for 有 一 1 NV. 


Theorem 7.1.Z7pader tpe apove comditio11y, 1 加 e 11U11e1ical Solutio1 of(2.7) opiazmed 
Dy 1pe 111etpod (2.7) 六 (2.8) mi 到 (2.9) Satisjies 


刀 (zmw2) = 万 (zozo) 十 O( 


0 < 7 不 < 玫 -Y+L， 
天 二 半球 罗 同人 





71e co1sla1118 Sy11Dolized py @O are idependent of 7 凡 w Satihing 1pe dpove 
CO1di1io11S，DU1 depe11d om N aqa1d 11e co11sta118 1 加 e co12di1io118. 


Prooj The estimates of Theorem 6.6 hold uniformly over bounded intervals. We 
now apply those estimates repeatedly on intervals of length /, for modulated Fourier 
expansions corresponding to different starting values. As long as (Zn Zn) satisfies 
the bounded-energy condition (2.3) (possibly with alarger constant 已 ), Theorem 5.2 
gives a modulated Fourier expansion that corresponds to starting values (Zn Zn). 
We denote the vector of coefficient functions of this expansion by Zn ( 旭 : 
Zr 一 (27 2 02) 

(omitting the notational dependence on 由 for simplicity). Because of the uniqueness， 
upto O(hX+l1),ofthe coefficient functions ofthe modulated Fourier expansion con- 
structed by ($.20), the following diagram commutes up to terms of size O(PANTT: 


(Zn 2n) 和 (zn(0) ,Zn(0)) 
numerical | 
method (Zn( 六 ;Zn (六 )) 
= (uptoOUN+D) 
(Zn+l Zn+l) < (zn+1(0) ,Zn+1(0)) 


The construction of the coefftcient functions via (3.20) Shows that also higher deriv- 
atives of zn at 琅 and znflat0differ by only O(AX+1I). From the above diagram 
and Theorem 6.6 we thus obtain 
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7[zaHi(0) = tm]( 二 OO 人) 
一 tj[zn](0) 十 O(PNT+DT) . 








Repeated use of this relation gives 
Ttj[z](0) = Tt [zol(0) 二 OAhNT+DT) . 


Moreover, by Theorem 6.0 the coefficient functions corresponding to the starting 
values (Zn zn) and (Zo,Zo) satisfy 


7tpz[zn](0) = 五 (Zn2n) 十 OP ， 
Itj[zo](0) 一 万 (zo， 2Z0) 十 O(P) & 


So we obtain 


万 (zn2Zn) 一 (zo,zo) Itp[zn](0) 一 ?tpz[zol(0) 二 O(P) 


= Oh>+) 二 O( ， 


which gives the desired bound for the deviation of the total energy along the numer- 
ical solution. The same argument applies to T(zn, Zn). 














The imposed bounds of w and 0 become important when jhw is close to an 
integral multiple of T. Are these conditions also sufficient to guarantee favourable 
energy behaviour uniformly in pw, arbitrarily close to multiples of rr? Unfortunately 
the answer ls negative (See Fig.2.3 to Fig.2.7). The analysis of method (2.7)-(2.9) 
for exact resonances hhw = IT with integer 7 Shows that stronger conditions 


lp(PoJl < Clsinc(po)l， ypo)gPoJl < Csinc2(joo) (7.2) 


are Tequired. Even this ls not sufficient for near-conservation of the total and the 
oScillatory energy for pw nearamultiple of T. For linear problems 


with a two-dimensional symmetric matrix 4 with aoo > 0, and with initial values 
Satisfying the bounded-energy condition (2.3), Hairer 儿 Lubich (2000a) Show that 
the numerical method conserves the total energy up to O(P) uniformly for all titmes 
and for all values of Auw, 让 and only 让 


(5) = sinc (6) 6) (7.3) 


There is mo method (2.7)-(2.8) which approximately preserves the osScillatory energy 
7 uniformly for all APw in a fxed open interval that contains a multiple of 27r. 

In summary, the bad effect of step-Slze resonances on the energy behaviour of 
the method cannot be eliminated, but it can be consliderably mitigated by an appro- 
priate choice of the filter functions 几 and 0%. 
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XIIU.8 Energy Behaviour of the Stormer-Verlet 
Method 


The results of Sections XIH.S-XIIU.7 provide new insight into the energy behav- 
iour of the classical Stormer-VYerlet method. We Present in this Section weakened 
versions of results of Hairer & Lupich (2000b). 

In applications, the Stormer-Verlet method is typically used with step Sizes 玉 for 
which the product with the highest frequency w is in the range of linear Stability, but 
is bounded away from 0. For example, in Spatially discretized wave equations,， Acw 
is known as the CFL number, which is typically kept near 1. Values of Pw around 到 
are often used in molecular dynamics. In contrast, the backward error analysis of 
Chap.IX explains the long-time energy behaviour only for Acw 一 0. 

Consider now applying the Stormer-Verlet method to the nonlinear model prob- 
lem (2.1)-(2.3)， 


Zn 一 2 十 Zn 1 一 一 ji2D2zn 一 P2VU(zn) ， (8.1) 


with hnw < 2 for linear Stability. The method is made accessible to the analysis 
of 9Sections XII.3-XJII.7 by rewriting it as a trigonometric method (2.6) with a 
1IOdi1jied Jreqgxemcy: 


Zn 一 2cos(PDJ)zn 二 zl = 一 P2VU(zn)， (8.2) 
where 
亡 0 0 。 可 j 
人 2 一 ( 动 with sin(5pw) 一 二 jc . (8.3) 


The velocity approximation 


Im 十 1 一 光 m 一 1 


2 


Zn 一 


does hot correspond to the velocity approximation (2.11) of the trigonometric 
method, but this presents only a minor technical difticulty. We Show that the fol- 
lowing 1zodijied eergies are well conserved by the Stormer-VYerlet method: 


页 (2 王妃 (2 人 十 苔 | 作 1 
洒 二 | 5 
天 (22) 一 Tc) 十 让 al 1 一 了 (co 





Here 妃 and 7 are again the total and the oscillatory energy of the System (2.1) 
(defined with the original w, not with 必 ). 


Theorem 8.1，Zet ie S1O7P11e 产 We7Fief1method pe applied 1o 加 e propble11z (2.7)-(2.3) 
wiij a step Size 玫 jporwpicn0< co 和 hs 和 cl<2and| sin (二 懂 7 人)| > cv 
Jorg=1..NJorsomeN >2an1dc > 0.9UDpose jnrtper 1pat tpPe 11aerical 
SOLU1io Valley Zn Stay 1 07egion on WPichn all derrmvatiesy or are pox1nded 7Tje1， 
1je 11odijied emersgies alo118 1Pe 11U1eFical SOULio SG 万 
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万 *(Zzn2np) 三 万 *(zoZo) 十 O(7) 


0< 7j< ANT+L 8.5 
Fr(za2) = Fr(zo2o) 十 O( 癌 如 SS Re 





7T71e co1sta11s Sy11polized py O a1e indepedeatofm 帮 wwWiiD1Pe apove condifio1ny. 


Poof With the modified velocities z defined by 
2 sinc(PO) 2 一 ZnH1 一 Zn 1 


method (8.2) becomes a method (2.6) with (2.11), or equivalently (2.7)-(2.8),， with 
winstead of w and with V(6) = 6) = |. 

The condition 0 < co 和 jw < cl < 2implies |sin(3jo)| ca > 0 for 
下 三 1,2, and hence conditions (7.1) are trivially satisfied with Apw instead of Acww. 
We are thus in the position to apply Theorem 7.1, which yields 


厅 = 互 人 十 OU 
0 for 0<mh<7j-N+l， (8.0) 
7T(zn Zn) = T(zo,zZ0) 十 OP) 


所 

WA 
四 

S 

2 
| 


where 互 and 7 are defined in the same way as 刀 and 1, but with w in place of w. 
The components of the Stormer-Verlet velocities n and the modified velocities Z 
are ITelated by 


W 
了 0 和 一 一 /人 加 冰 人 
Zn0 一 2m0， Zn1 一 Sinc(Pw) Zn 1l 一 却 V 一 了 2w2 Zn 1 ， (8.7) 





So that 

克 /了 了 一 工 / 2 工 一 2 2 

(Zn Zn) 本 本 zz 十 50 zi 

二 > 
2 上 wa 二 了 zz (8.8) 
下 
全 2 

Similarly， 


下 ， 
豆 "(zm2o) 一 了 lzno 上 十 区 (za) 十 严 (zn2n) 


人 LU2 本 (8.9) 
一 刀 (zn21) 十 ( 扣 一 1 T(zny2Z1) ， 











and hence (8.6) yields the result. 





For fxed pw > co > 0 and 玉 一 0,the maximum deviation in the energy does 
not tend to 0, due to the highly oscillatory term 去 y|| 人 | in 刀 *(z; 他) and 产 (z 仑 ). 
We Show, however that 1i7ae averagesy of 妃 and 7 are nearly preserved over long 
time. For an arbitrary fxed 三 > 0, consider the averages over intervals of length 了 ， 
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1 








万 一 一 几 >》， (和 
15jI<TV2 全 1 
7 了， -= 生 二 >》 T(znh2n+ 让 ， 
二 


Theorem 8.2， Zader ie co1iditio11s of 7T1eore1a 8.71 11e 1i1e averagesy of ipe 1otal 
G1C 1jpe oscillatory ee18y al1o118 1Pe 11U11eFicaL SOULiomn sais 放 
吉 。 三 天 ,全 次 册 


区 四 0<7j< PNT+L 8.11 
0 





771e co1sta11s Sy11Dolized py O a1e ipndepedeatofm 帮 w wiiD1Pe apove condifio1ny. 


Proojf We Show 


人 
Zn = 区 (Zn Zn) 21 十 7 (Zn Znm) 起 O(P) 
加 1 7 (8.12) 


which implies the result by Theorem 8.1. Consider the modulated Fourier expan- 
Sions of Zn and Z1 fort = 7 几 in abounded interval. Theorem 5.3 shows that 


2 1 兰 话 (e 直 1 人 一 ee ( 鸭 ) 十 O(P) ， 二 一 7 几 ， 


with 过 1 ( 娘 from the modulated Fourier expansion of Theorem S$.2 (with w instead 
of w). With (8.7) it follows that 


Zi1 二 和 0VI1 一 2w2 (ei 直 1 一 e 光 1 十 O(P ， 


and therefore, recalling the definition of 了， 


1 2w 
是 (22 要 一 2Reeze 有 (6 人 十 OU ， 


Theorems 3.2 and 9.3 yield 
222| 站 :的 | = Tcznz 人 十 O( 
and hence, by (8.8)， 





2o22 的 性 = 天 (zzn) 二 O(CO 
A partial summation shows that the time average Over the highly oscillatory terms 
e2wtw2o1(b2 is O(A) .This fnally gives 
oa 二 OO 人 
元 也) 三 一 三 (ZnZn 
让 全 1 十 7 
171<TV/2 


Taking the time averages in the expresSsions of the definition (8.4) of 万 ”and 六 then 
yields (8.12). 
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50 100 150 50 100 150 


Fig. 8.1. ITotal energies (lefb and their predicted averages (right) for the Stormer-Verlet 
method and for two different initial values, with w 一 50 and 彤 such that hiw 一 0.8 


Figure 8.1 illustrates the above result. It Shows the total energy 所 for two dif- 
ferent injtial values on the left, and the averages as predicted by the expression on 
the right-hand side of (8.12) on the right picture. The initial values are as in Chap. 工 
with the exception of zl1(0) and 211(0). We take zl1(0) = V2/w, zi1(0) =0 
for one set of initial values and zi1,1(0) = 0, Za1(0) 三 V2 for the other The total 
energies at the initial values are 2.00240032 and 2, respectively. 


XIIU.9 Systems with Several Constant Frequencies 


This section Studies the conservation of invariants and almost-invariants along nu- 
Imerical approximations of an extension of (2.1) to Systems with the Hamiltonian 
function 


工 主 
五 pg= 5 Mip+T5E0 .49+IOg (9.1) 


222 4 
with a positive definite constant matrix AM and a positive Semi-definite constant 
matrix 4. With the Cholesky decomposition M = 也 Z7 and the canonical transfor- 
mation 方 一 了 -1D, GT 一 ZL79 we obtain aHamiltonian where the mass matrix is the 
identity matrix and 4 is transformedto 4 = 了 -147 开 .Diagonalizing4 王 Q4QL 
and transforming to 2Z = 三 QI then yields a Hamiltonian of the form (we omit the 
tilde on V(zZ) = UV(dq) and 五 (z,2) = 万 (D,9)) 


人 2 
SA 和 
Ce) 动 = 了 2 人 (|5 虹 + 写 lm 要 ) + GO) 
JJ=0 
where zZ 三 (20,21), .2Zp) with Z7 GE RS ,No = 0, and XJ >0for7>1areall 
distinct. After rescaling s we may assume Xi) 之 1 for7 一 |. .，[. 


Following Cohen, Hairer 久 Lubich (2004) we extend the results of the previous 
Sections to the multi-frequency case 上 > 1. Modulated Fourier expansions are again 
the basic analytical tool. A new aspect is possible resonance among the 入 7. 
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XIII.9.1 Oscillatory Energies and Resonances 


The equations of motion for the Hamiltonian System (9.2) can be written as the 
System of second-order differential equations 


艺 一 一 0D27 十 g(z)， (9.3) 


where 1 = diag(wj 丰 ) with the frequencies wj = Aj/s and g(Z) = 一 VU(z). As 
Suitable numerical methods we consider again the class of trigonometric integrators 
studied in Sect. XII.2, (2.6) with (2.11), with filter functions and . 

We are interested in the long-time near-conservation of the total energy 万 (zz,2) 
and the oscillatory energies 


7 ， 
(zc 划 = 了 (5 有 + 芝 lzzP) for 7>1 GO 


or Suitable linear combinations thereof. Benettin, Galgani 儿 Giorgilli (1989) have 
Shown that the quantities 


汪 
厂 (z, 划 一 达 总 万 (z, 刘 (9.5) 


are approximately preserved along every bounded solution of the Hamiltonian SyS- 
tem that has a total energy bounded independently of s, on exponentially long time 
intervals of size O(ec/=) the potential DT(z) is analytic and = (1 ,He) 
orthogonal to the 7eso1a1zce 11Odu1e 


A = 人 KEZL:HIANI 十 .十 ieXe=0}， (9.6) 


这 a diophantine non-resonance condition holds outside 人 4. (Cf. also Sect. XIII.9.4 
below.) 

Since/ 三 入 is orthogonal to 人 4, the total oscillatory energy 民 )-- 7(zZ,Z) of 
the System is approximately preserved independently of the resonance module 人 4. 
Subtracting this expression from the total energy (1.7), we See that also the 710o0 妨 
e11e18y 


天 (z, 罗 = 下 aol2 -Cr(z) GO7) 


is approximately preserved. With an s-independentbound of the total energy 五 (z, 2) 
wehave 2j = O(e) for7y = 1 sothat 及 (Z,2) is close to the Hamiltonian 
of the reduced System in which all oscillatory degrees of freedom are taken out， 
万 0o(zo,zo) 三 下 |2o||2 十 UL(zo,0,..….,0). 


了 xample 9.1. To illustrate the conservation of the various energies, we consider a 
Hamiltonian (1.7) withZ = 3, 入 = (1 V2,2) and we assume that the dimensions of 
27 are all 1 with the exception ofthat of Z1 三 (Z11,2Z12) Which is 2. The resonance 
module is then AM = {(5;0,ka); 1 十 2k3 = 0}. Wetake s- 1 一 w=70,the 
potential 
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Fig. 9.1. Oscillatory energies of the individual components (the frequencies Xijw 三 Xj/s 
are indicated) and the sum 六 十 7a of the oscillatory energies corresponding to the resonant 
frequencies w and 2w 


TU(z) = (0.05 二 zi1 十 2Z12 十 Za 十 2.5273) 十 2 1 十 2 (9.8) 


and Z(0) = (1,0.3s,0.8s, 一 1.1s,0.7s), zz(0) = (一 0.2,0.6,0.7, 一 0.9,0.8) as ini- 
tial values. We consider Ju for/ = (1,0,2) and = (0,V2,0), which are both 
orthogonal to 人 4. Im Fig. 9.1 we plot the oscillatory energies for the individual com- 
ponents of the System. The corresponding frequencies are attached to the curves. 
We also plot the sum 嫉 十 73s ofthe three oscillatory energiles corresponding to the 
Tesonant frequencies 1/s and 2/s. We see that 九 十 73 as Well as 72 (which are 刀 
for the above two vectors / 上 人 1) are well conserved over long times up to Small 
oscillations of size O(s). There is an energy exchange between the two components 
corresponding to the Same frequency 1/s, and on alarger Scale an energy exchange 
between 1 and 73. 


Numerical Experiment. As a first method we take (2.6) with 0%(6) = 1 and (E) 三 
sinc(e), and we apply it with large step Sizes so that jw = 几 /s takes the values 1, 2， 
4,and8. Figure 9.2 Shows the various oScillatory energies which can be compared to 
the exact values in Fig. 9.1. For all step Sizes, the oscillatory energy corresponding to 
the frequency V2w and the sum 万 十 妨 are well conserved on long time intervals. 
Oscillations in these expressions increase with 几 . The energy exchange between 
TesSonant frequencies ls close to that of the exact Solution. We have not plotted the 
total energy 歹 (Zzn, Zn) nor the Smooth energy 玫 (Znzn) of (9.7). Both are well 
Conserved over long times. 

We repeat this experiment with the method where (上 ) = 1 and WwW(C) = 
sinc2(e/2) (Fig. 9.3). Only the oscillatory energy corresponding to V2w is approx- 
imately conserved over long times. Neither the expresslion 嫉 十 13 nor the total 
energy (not Shown) are conserved. The smooth energy 开 (zn;, zn) is, however, well 
conserved. 

Figure 9.4 shows the corresponding result for the method with %(6) = sinc(6) 
and (5) = sinc(6)0(6). The oscillatory energy for V2w and also 万 十 有 are well 
conserved. However the energy exchange between the resonant frequencies ls not 
Correctly reproduced. 
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Fig. 9.2. Oscillatory energies as in Fig.9.1 along the numerical solution of (2.6) with (6) 三 
1andw(6) = sinc(6) 
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Fig. 9.3. Oscillatory energies as in Fig.9.1 along the numerical solution of (2.6) with 0%(6) 三 
1 andW(e) = sinc"(E/]/2) 
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Fig. 9.4. Oscillatory energies as in Fig.9.1 along the numerical solution of (2.6) with 0%(5) 三 
sinc(6) and W(S) = sinc(S)%(6) 


XIII.9.2 Multi-Frequency Modulated Fourier 了 xpansions 


The above numerical phenomena can be understood with a multi-frequency version 
ofthe modulated Fourier expansions studied in the previous chapter We only outline 
the derivation and properties，Since they are in large parts Similar to the Single- 
frequency case. More details can be found in Cohen, Hairer 多 Lubich (2004). We 
asSsume conditions that extend those of the previous Sections: 


e The energy of the initial values is bounded independently of <， 


了 2(0) 上 2 + 2z(O)12 < 互 . 0.9) 
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e Thenumerical solution values 2zn stay in acompact subset of a domain on which 
the potential L is smooth. 

We impose a lower bound on the step size: P/s > co > 0. 

We assume the numerical non-resonance condition 





几 
sin( 二 居 | > cvJ forallieZe\Mwihnlil<N， 0910) 
for someN >2andc> 0. 
e For the filter functions we assume that for 6 三]Aj/s (7 三 1 0)， 
Jp(5) < Cn sinc (二 67) ， 
le(6)| < Ca |sinc( 直 67)|， (9.1D 


12(S 放 么 Cs |sinc(S5) OCS 诈 ， 


The conditions on the filter functions are Somewhat stronger than necesSary, but they 
facilitate the presentation in the following. 

For a given vector 入 = (和 1 ...,Xe) and for the resonance module AM defined 
by (9.6), we let KC be a set of representatives of the equivalence classes in ZL /人 4 
which are chosen such that for each KE K the sum |K| = | 如 | 十 .十 |el is minimal 
in the equivalence class |] 三 天 十 人 ,and with ge 人 ,also 一 KE E 大 . We denote， 
for N of (6.3)， 


NeK:i<N Ar=NN{0 0) (9.12) 


The following multi-frequency version of Theorem XIII.3.2 establishes a modulated 
Fourier expansion for the numerical solution. 


Theorem 9.2， Co1zsider 加 e 11U1ierical solution of 1pe syste1a (9.3) py 加 e 171etpod 
(2.6) wii step yize 凡 Unader coadifio11g (9.9)(9.77) 1pe Pierical So1utiozz ad111i1y 
G11 CEXDG11S1O11 


Zr 二 的 十 》 eetc( 有 十 亚 .O(P2PY) (9.13) 
有 ENA* 


Wi 太 w = 和 /sa 加 pyjor0<t=7<Tanadsadnsatispig Rs > co > 0. 
THe 1iodulation factions 1o8getper wii Ga1 teir derimativesy (UPD 1o So11e GDPiNa1iy 
Jixed order) ae pox1ded D7y 


om = OU， 力 =O(c) 
交 人 (9.14) 


站 一 Oct) or 天 天 十 (人 











Jor7 =1 .Pere (1 = (0 10) tey7ipanityvector The 1ast es- 
1171ate 1olds also Jor 人 JoralpE EN Moreover 1jpe jnnction yy 1 7eal-valued 
ad 2 一 3 Jor al 有 EA*. THe comstants Symbolized py 1pe O-notafion a1e 
idepeadet of 凡 E ad Ai WE(9.710) Di tey depe1ld on 已 VCc ad 
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The proof extends that of Theorem XIIU.3.2. In terms of the difference operator 
of the method, Z(PD) = es2 一 2coshP 十 ethefunctionsy(b and zx(t) are 
constructed such thatb up to terms of size 歼 . O(PN+2)， 


1 77 CC 

LADJy = (gg)+ > 二 90(50)(53") 
s(a)~0 

也 (PP 十 访 有 ww) 大 二 7 区 >》 。 )(G@ 四 )(G2)2. 
SsS(a)~ 天 


Here, the sums on the right-hand Slide are over al mm > 1 and over multi-indices 
a = (al am)j with oj E AN for which the sum s(a) = 定 产 ; ay Satisfies 
the relation s(Q) ~ K, that is, s(Q) 一 E 人 4. The notation (z)” is Short for the 
ma-tuple ( 瑟 z21，... ) 瑟 zcm). 

A similar expansion to that for Zn exists also for the velocity approximation Zn， 
like in Theorem XII.3.3. As a consequence, the osScillatory energy (9.4) along the 
numerical solution takes the form, att 三 7mP 艾 了， 


万 (zi) = 2031z 和 的 上 2 上 + O(e). (9.15) 


With the first terms of the modulated Fourier expansion one proves, as in Theo- 
Tems XII.4.1 and XII.4.2, error bounds over bounded titme intervals which are of 
Second order in the positions and of first order in the velocities: 


lz 一 zolEC 到 2 一 2to<C， (9.10) 


where C is independent of s, 凡 and 风 with 7 六 科 人 and of bounds of solution 
derivatives， 


XIII.9.3 Almost-Invariants of the Modulation System 


Withyot =20 罗 =V 鸭 and 人 一 eetzke(bforkEAwhereyandzeare 
the modulation functions of Theorem 9.2, we denote 


了 二 (多 )keN， 民 二 (办 )keN， 


We introduce the extended potential 


1 
WU) = (BT > 二 DO0(B00)(G7)” ， (9.17) 
7 
s(oa)~0 
where the sum is again taken overall7m > 1andall multi-indices aw = (Qi,. ,am) 


with aj E A” for which s(a) = 2 7) ay E 人 .The functions (bthen satisfy 


ZIGj -2272D) 检 = 一 V_kUdJ)+E. ON)， (9.18) 
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where V_K denotes the gradient with respect to the variable 7 一 ^. This system has 
almost-invariants that are related to the Hamiltonian 媚 and the oscillatory energies 
刀 with / 上 人 4. 

The Energy-Type Almost-Invariant of the Modulation System. We multiply 
(9.18) by (W 外 7 and sum over KE N to obtain 


>》 (的 7E 一 本 127(AD) 央 十 2 = OU )， 
EEA/ 


Since we know bounds of the modulation functions zk and of their derivatives from 
Theorem 9.2, we Tewrite this relation in terms of the quantities z&: 


》 (一 记 :oc 人 7TE167 -27(D 十 iow) 大 十 和 2 =O(/). 
KEA/ 
(9.19) 


As in (0.21) we obtain that the left-hand side of (9.19) can be written as the time 
derivative of a function 7t*[z]( 坟 which depends on the values att ofthe modulation- 
function vector zand its first N time derivatives. The relation (9.19) thus becomes 


d 米 
二 允 四 (OO = O(z)， 


Together with the estimates of Theorem 9.2 this construction of 7t” yields the fol- 
lowing multi-frequency extension of Lemma XIII.6.4. 


Lemma 9.3，VU7der 1pe assu1aptionsy of 7Tjeoe111 9.2, 加 e 11odulation jpactiony Z 王 
(zz)kecAN/ OF 1pe 11U1ie1ical solution Satis 认 


?tl 回信 = 72(0) + Op ) (9.20) 
Jor0 艺 夺 所 了 .Moreoyvei att 三 jwe Pave 
Xtr 因 (= 豆 *(zw2) 十 OU (9.21) 


Where, wii ol) = Sinc(C)0(C)AVDC) and 5 = ANj/e， 


于 
于 (划一 厂 (人 十》 (ac() 一 ] 万 (人 . (9.22) 


JJ 一 1 


The Momentum-Type Almost-Invariants of the Modujlation System. The equa- 
tions (9.18) have further almost-invariants that result from invariance properties of 
the extended potential 4, similarly as the conservation of angular momentum results 
from an invariance of the potential Z in a mechanical System by Noether s theorem . 
ForE 了 4andy 一 (We)kcN Weset 


Su(TDJy = (eg)kew， 7TE 到 
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So that, by the multi-linearity of the derivative, the definition (9.17) yields 


eis(a)'AT 





WU(Su(r)y) = (Bog)+ > Tem(Goy0)(Gy)" ， (9.23) 


s(a)~0 


7 


开 4 上 Ad,thentherelation s(a) 人 0implies s(a): 风 = 0,andhence the expression 
(9.23) is Independent of 7. It therefore follows that 


0= 二 | wsn= 科 庆 站 (TY 


ad7Tj1r=0 NEA/ 


for all vectorsy 一 (kecN. INis not orthogonal to 人 M, some terms in the sum of 
(9.23) depend on 7. However, for these terms with s(a) E Af and s(a) . 几 天 0 we 
have |s(a)| 之 AM =minflfil : 0 和 关上 EAand 让 /上 上 AN,then|s(a)| 之 V 十 1 
The bounds (5.13) then yield 


yi ( 内 ) VCy) = 


KEA/ 


| O(ewY) forarbitrary 风 2 


O(esN+1) for 风 工人 AN 


for the vectory = y(b) as given by Theorem 9.2. Multiplying the relation (9.18) by 
(一 大 .AN) (本 and summing over E E 人 , we obtain with (9.24) that 


E 。 > (人 BOD) 因 =O()+OGe 
EEA/ 


The O(eY%-1) term is not present for 1 上 人 AN. Written in the z variables, this 
becomes 


(站 (2 的 7 到 -12L(AD 十 放下 = OUN) 上 Oexr1)， 
KEA/ 
(9.25) 


Asin (9.19), the left-hand expression turns outto be the time derivative of a function 
妃 回 ( 鸭 which depends on the values att of the functionzand its first V derivatives: 


所 丈 园 的 = OU +OGx-) 


Together with Theorem 9.2 this yields the following. 


Lemma 9.4.， Lader 1 加 e GaSSU1ID1io1s of 7T1eo1e11a 9.2，1pe 711odulatio1z actiony Z 
SQLS 帮 ， 
水 二 # V MI 一 1 
开 四 介 = 丈 四 (0) + OUPN) + OUtex-D) (9.20) 


Jpora1EReandjpor0<t 芭 7 Theysati 记 


胞 四 的 = 丈 四 (0) + OU ) (9.27) 
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Jor1L 上 人 Nand0 芝 1 所 了 Moreover alt 三 岂 几 
及 四 (人 三 到 (zy2n) 十 O(e)， (9.28) 


Were, againWi af) =sinc(E)0(E)AVpD(G)， 


地 
开光 = rr 父 ) 站 厂 C 训 ， 09.29) 


XIII.9.4 Long-Time Near-Conservation of TIotal and Oscillatory 
Energies 


With the proof of Theorem XII.7.1, the above two lemmas yield the following 
results from Cohen, Hairer 氏 Lubich (2004). 


Theorem 9.S，Vmader comditions (9.9 六 (2.77) 1pe 11aterical solution opiaized By 
111et1jod (2.6) mw1 碎 (2.17) satisjiey 1jor 万 a1d 了 大 dejired by (9.22) da10 (9.29)， 


天 (zzn) = 如"(zozo) 十 OU 


了 . Jor 0<7 下 区 万 -+1 
及 (ZnyZn) 二 12(20,Z0) 十 OU 





JorERCwp 了 NALAMN = 人 KE 人 :RS NTheconstants sympolized py O 
G1e indepe1idenat ofmu 太 E 和 AJSat1isHi128 加 e apove co1difioay DUL deped om TV a1d 
1je co11s1a1218 1 17e co11ditio118. 


Since 人 /三 入 is always orthogonal to 人 4 and to 人 MMN, the relation 
开 (z22) 三 古 " (zz 一 人 (2 二 ) 
for the Smooth energy (9.7) implies 
开 ( 人 (zi 三 开 (zozo) 二 OO for 0X 芭 六 芭 因 + (9.30) 


For ol() = 1 (or equivalently (5) = sinc(C)0%(6)) the modified energies 万 ”and 
了 让 are identical to the original energies 已 and 1 of (9.2) and (9.3). The condi- 
tion W 人 (CE) = sinc(C)o(e) is known to be equivalent to the symplecticity of the one- 
step method (Zn zn) Fr (Zn+Hlzn+l), but its appearance in the above theorem 
is caused by a different mechanism which ls not im any obvious way related to 
Symplecticity. Without this condition we sttl have the following result, which also 
considers the long-tme near-conservation of the individual oscillatory energies 姜 
for7 1 ...)L. 


Theorem 9.6，VPader comditions (9.9)(9.77) 1pe 11aterical solution opiaized By 
111etjod (2.6) Wi 碎 (2.77) Satisjiey 
互 (zzn) = 万 (zzo,zo) 十 O(M) 


. ， Dr 0<7? 妆 区 彤 .min(e 一 MT+L AN 
人 ) 
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Jor7 三 1 01M=minfll :0 和 有 EA 人 Theconstants Sy1zpolized by 
O da1e indepenadent ormu 见 s Ai SGQ1iSinag 1pe apove coadifioly PU depend om 
G1C 11e co11S1G118 171 加 e co11d111o118. 


For the non-Tesonant case 人 = {0} we have M = co and hence the length of 
the interval with energy conservation 1$s only restricted by (9.10). Notice that always 
M >3,andMWM=3onlyinthe case of al:2resonance among the 入 j. For al1:3 
TeSonance wehave M =4andin all other cases AM > 5. 


xplanation of the Numerical Experiment of Sect.XIIH.9.1. All numerical meth- 
ods in Figs. 9.2-9.4 Satisfy the conditions of Theorems 9.6 and 9.5 for the step SiZzes 
consldered. 

In Fig.9.2 we have the (symplectic) method (2.6) with 0%( 上 ) = 1 and (6) = 
sinc(6)，which has o(6) = 1 So that 巨 and 五 ", and Tu and 六 coincide. For all 
Step Slzes, the oscillatory energy 72 corresponding to the non-resonant frequency 
V2w and the sum 万 十 工 3 are Well conserved on long time intervals, in accordance 
with Theorem 9.5. The individual energies 7 and 713 corresponding to the resonant 
frequencies w = 1/s and 2/s are not preserved on the time scale considered here， 
cf. Fig. 9.1. In fact, Theorem 9.6 here yields only atime scale O(hnE-2). 

In Fig.9.3 we use the method with %(c) = 1 and W(6) = sinc2(e/2), for which 
ale) is not identical to 1, and hence 五 and 万 *, and 了 and 7 隐 do not coincide. 
The oscillatory energy 172 一 0 With = 〈0,1,0) 上 人 4, which corresponds 
to the non-resonant frequency V2w, is approximately conserved over long times. 
Since Theorem 9.3 only states that the 71odijed energies are well preserved, it ls 
not SUrprising that neither 六 十 13 nor the original total energy 巨 not shown in the 
fgure) are conserved. The modified enersgies ” and ol7 十 as73s (not Shown) are 
indeed well conserved, and So is the Smooth energy 玉 , in agreement with (9.30). 

Figure 9.4 Shows the result for the (Symplectic) method with %(C) = sinc(6) and 
(5) = sinc(6)%(6. Since oa(6) = 1 the oscillatory energy 7 for V2w and also 
嫉 十 1a are well conserved, in agreement with Theorem 9.3. However, the energy 
exchange between the resonant fredquencies ls not correctly reproduced. This behav- 
iour is not explained by Theorems 9.3 and 9.6, but it corresponds to the analysis 
in Sect.XIIH.4.2 which, for the Single-frequency case, explains the incorrect energy 
exchange of methods that do not satisfy w(C)%(c) = sinc(C) (and thus, of all sym- 
plectic methods (2.7)-(2.10), with the exception of the above method with 0%(E) 一 工 
and uW(C) = sinc(6). That analysis could be extended to the multi-frequency case 
considered here. 


We remark that the techniques of Sects.XII.9.2 and XII.9.3 can also be used 
to Study the energy error of the Stormer-Verlet method, as in Sect.XII.8; see The- 
orem 3.1 in Cohen, Hairer & Lubich (2004). The modujlated Fourier expanslion of 
the exact solution yields results on the near-preservation of the oscillatory energies 
along abounded exact solution: under the energy bound (9.9) and the non-resonance 
condition 

[KAI>cvs forkeZ\Awithlt<N (9.31) 


we have (See Theorem 6.1 in Cohen, Hairer & Lubich 2004) 
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灰 (zZ 的 ) = 灰 (z(0),2(0)) 二 TO(e) for0<t 芭 ev (9.32) 
forHwE 开 with 上 AN={EeEA 人 :IE < N} Wefurtherhave 


万 (z 的 ,之 的 ) = 万 (z(0),z(0))+O(e) for 0<t< smin(e fce ) 
(9.33) 
for7 =1 .2 with M =Imin{flbl :0 天 有 EA 人 4 


XIIHL.10 Systems with Non-Constant Mass Matrix 


The high frequencies of the linearized differential equation remain constant up to 
small deviations for mechanical systems with a Hamiltonian of the form 


如 (2,9) = 5p8 Mo(O)-1po 十 jpfMT ap 十 3pTR(O)p 二 志 红 4 二 (Cg) 
(10.1) 
with a symmetric positive definite matrix Wo(a), constant symmetric positive defi- 
nite matrices Ma and 41, asymmetric matrix 尺 (q) with 


尺 (q0， 0) 二 0， 


and a potential LV(q). All the functions are assumed to depend smoothly on 4d. 
Bounded energy then requires qi = O(sj, so that DR(op = O(s), but the deriva- 
tive of this term with respect to qi is @O(1). 

Asin (9.1), we may asSsume, after an appropriate canonical linear transformation 
based on a Cholesky decomposition of the mass matrix and a diagonalization of the 
resulting stiffness matrix, that the Hamiltonian is of the form 


《 2 
1 1 入 7 1 
感人 = 5-m+3 了 (olP+ 汪 lo)+57ROp+CO 
JJ 一 1 


(10.2) 
with distinct, constant Aj) 过 工 . 
The necessity for such a generalization results from the fact that oscillatory me- 
chanical Systems with near-constant frequencies in 2 or 3 Space dimensions typically 
cannot be put in the form (9.1), but in the more general form (10.1) or (10.2). 


上 xample 10.1 (Stiff Spring Pendulum). The motion of a mass point (of mass ]1) 
hanging on a massless stiff spring (with spring constant 1/s2) is described in polar 
coordinates 2Z1 一 7Sinw%, Z2 一 一 cos% by the Lagrangian with kinetic energy 
全 一 瑟 3 十 22) 一 了 (72 十 7202) and potential energy U = 记 (一 1)2 一 rcos p， 
With the coordinates qo = %, 91 = 了 一 1 andthe conjugate momenta Di; = 07 /00; 
this gives the Hamiltonian 


1 1 
吾 (p,g) = 了 (十 9) 20 十 三) 十 区 邓 一 (1 二)cosao， 


which is of the form (10.2). 
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Numerical methods for Systems (10.2) are studied by Cohen (2005). He splits 
the small term 到 DR(o)p from the principal terms of the Hamiltonian and proposes 
the following method, where 


1 匠 
天 (po,g) = 520 Mo(g) po 二 D(g)， 


Algorithm 10.2. 7. 4 Pa 太 step miip te yy11plectic EUier 1ietnod applied 1o 加 e 
SySte11a Wizp 瑟 azailiomia1 3 RR(0)p 8iveys 





用 工 
六 = 下 -2vo(3B)7R(on)i) 

(10.3) 
4 = 用 十 三 及 0) 六， 


2. 77eatizg 1pe oscillatory co1DPo1e1l1s of1pe va1iapies D Ga11d 9 Wi a 1rigo1o111e- 
17C 111etpod (2.7)(2.8) a1d te Slow co11Po1me1a1s Wiip te 91Om1ier- 了 Werefycpe111e 
yields (jporjJ =1 .landwi 太 wj 三 Ai/sand5 三 Poj 





mi = 曾 -3YoKC en) 

0 (VE 09) TVoEOSt2 90t] 

入 和 cos() 久 二 olsin(6) 针 一 与 W6)Vo 天 (ET 00n) 

羽生 一人 的 名 二 cos 六 一 2 (加 (YoKG8T 2 507) 
十 几 (5)VoKE(p8 2 001)， 

部 +1 = 枯 一 2VwR(0HT2 894) (10.4) 


mWjpere 瑟 = 思 OO) mwO2 =diag(wi 门 . 
3 了. 4HalFstep Wi 太 1je adjoinat sy1iDlectic EU1er 1Ietpod apPlied 1o tpe syste11a Wit 太 
再 amzzltomia7m 32D7 Rdq)P Siveys 








四 | 本 
pi = 名 时 了 Ve(iBrr07R()B 
(10.3) 
ont1 一 名 十 和 (on+)P 


The filter functions 内 ,Wo,Wi,b are again real-valued functions with W(0) = 三 
(0) = W(0) 0%(0) 1 that satisfy (2.9). The method is still symplectic 让 
and only 诺 (2.10) holds. Note that Step 2. of the algorithm is explicit 证 Mo(d) does 
not depend on 090. 

Cohen (2004, 2005) studies the modulated Fourier expanslion of this method and 
Shows that the long-time near-conservation of total and oscillatory energies as given 


by Theorem 9.0 remains valid also in this more general Situation . 
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上 xample 10.3 (Triatomic Molecule). The motion of a near-rigid triatomic mole- 
cule is described by a Hamiltonian System with a Hamiltonian (10.2). For Simplicity 
we fx the position of the central atom. We then have two stiff-spring pendulums 
Strongly coupled by another Spring. With angles and distances as Shown in Fig. 10.1， 
we use the position coordinates 1,91 = 了 1 一 1,p2,d2 一 72 一 with the conjugate 
momenta 7l,2D1,72,D2, TeSpectively. The Hamiltonian then reads 


1 区 
万 (rp,p,9qg) = j(G+o) 2 十 十 (1 十 2) ?7 十 友 ) 
1 2 2 2 2 
丰 训 (2 十 2 十 可 (22 一 9) | +U(p,9) (10.0) 


with a Spring constant 于 /s2 for connecting the two pendulums and an external 
potential /7 . With the canonical change of variables 


人 
d0 V2N\N1 1 2 / Tr2 V2N\1 1T/N\ 人 po/ 
the Hamailtonian takes the form (10.2): 


1 1 
豆 (D,9) = 5 二 叶 十 屯 十 03) 二 55( 时 十 星 十 o 9) 


2 (10.7) 
十 27R(Op+U(g) 


WwWith 
1 291 十 0 
4 (1 十 91)2 


1 292 十 02 
4 ( 工 十 92)2 











ZR(O)p = (po 一 p3) (po 十 ps) 
and L7(9) = UP1, 922, 91,92). 

For the water molecule the ratio between the frequencies of the bond angle and 
the bond lengths is a 久 0.2, according to Some popular models. In our numerical 
experiments, we observed good conservation of all the oscillatory energies and the 
total energy. More interesting phenomena occur in a near-Tesonance Situation. We 
consider a = 0.49 and <s = 0.01, no exterior potential (L = 0), and initial values 
d(0) = (0,s/2,s,a/s) and p(0) = (1.1,0.2, 一 0.8,1.3). mm Fig. 10.2 we apply 
the method of Algorithm 10.2 with step Sizes 丸 = 0.5s and 玫 = 2s and obtain 


万 2 
1042 


Fig. 10.1. Water molecule and reference configuration as gray Shadow 
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Fig. 10.2. Oscillatory energies and total energy for the method of Algorithm 10.2 
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0 50 100 0 50 100 
Fig. 10.3. Oscillatory energies and total energy for the Stormer-Verlet method 


numerical results that agree very well with a solution obtained with very Small step 
SiZes. For comparison we Show in Fig. 10.3 the results of the Stormer-VYerlet method 
with step Sizes 几 = 0.2s and 几 = 0.5s, for which the energy exchange is not 
Correct. For the reason explained in Sect. VI.3, (3.2)-(3.3), both methods are fully 
explicit for this problem. 


XIU.11L 玉 Xercises 


1. Show that the impulse method (with exact Solution of the fast System) reduces 
to Deufhard's method in the case of a quadratic potential 帮 (9) 三 二 097 .494. 
2. Show that a method (2.7)-(2.8) satisfying (2.9) 1s Symplectic 让 and only 让 


W(S) 一 sinc(60E) for 和 = jw 


3. The change of coordinates Zn = X(ROD)zn transforms (2.7)-(2.8) into a 
Imethod of identical form with ,WwWo,Wi replaced by YX, X 一 1 X 一 1W%0， 
X-1W1. Prove that, for jp satisfying sinc(Po)g(po)/p(po) > 0, it is pos- 
Sible to fnd X(Pw) such that the transformed method is symplectic. 

4. Prove that for infinitely differentiable functions 9g(t the solution of 芝 十 w27 一 
9 人 (bcan be written as 


Z(b 三 UV 人 十 cos(wbv 扩 十 sn(wbv( 坟 ， 
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where yy 人 (uv(b,u(b are given by asymptotic expansions in powers of w-!. 
ZL Use the variation-of-constants formula and apply repeated partial integra- 
tion. 


. Show that the recurrence relation en+1 一 2cos(h2)en 十 en-l 王 bn hasthe 


Solution 





en+1 三 一作 16eo 十 克 wel 十 >》， 全 
J=1 
with HH = sin(hD)-1sin((n 十 1)PD) (or the appropriate limit when 
sin( 几 (2) is not invertible). 


.Consider a Hamiltonian 瑟 (pR,pr,qR,Qqr) and let 


7tD,9) = 2 万 (DR,DT,dR, GT) 


for2p = DR 二 ?ID1, dg = qR 十 297. Prove that in the new variables D, dg the 
Hamiltonian System becomes 


9， = 
了 一 页 7 4 遍 d 一 而 刀 , 4) 


. Prove the following refinement of Theorem 6.3: along the solution z( 妨 of (2.1)， 


the modified oscillatory energy .J(z,2) 一 TU(z)2) 一 2Zf9i(z) satisfies 


J(z 人 ,2 人 (区 ) = J(z(0) 20) +O(w- 2 二 Oo) . 


. Deftine 瑟 (z, 划 = 万 (z,2 一 pzfogi(z)， J(z, 了 ) = J(z2 一 oztoi(z) with 


J(z,Z) of the previous exercise and with 


| 


sinc2( 寺 po) 
In the situation of Theorem 7.1, show that 


瑟 (zwi) = 瑟 (zozo) 二 O(J) 


区 for 0<7 帮 二] NT+L 
J(zm2n) = vJ(zozo) +O(O2) 
Notice that the total energy 巨 (zw,Zn) and the modified oscillatory energy 
.J(zn am) are conserved up to O(12) 让 p = 0,ie., 让 W%(6) = sinc2(36). This 
explains the excellent energy conservation of methods (A) and (D) in Figure 2.5 
away from resonances. 


. Generalizing the analysis of Sect. XIIU.8, study the energy behaviour of the im- 


pulse or averaged-force multiple time-stepping method of 9ect.VJIIL.4 with a 
fxed number N of Stormer-Verlet Substeps per Step, when the method is ap- 
plied to the model problem with pw bounded away from zero. 


Chapter XIV. 
Oscillatory Differential 上 Equations 
with Varying High Frequencies 


New aspects come into play when the high frequencies in an oscillatory System 
and their associated eligenspaces do not remain nearly constant, as in the previous 
chapter， but change with time or depend on the solution. We begin by Studying 
linear differential equations with a ttime-dependent skew-hermitian matrix and then 
turn to nonlinear oscillatory mechanical Systems with time- or Solution-dependent 
frequencies. Our analysis Uses canonical coordinate transforms that Separate Slow 
and fast motions and relate the fast oscillations to the Skew-hermitian linear case. For 
the numerical treatment we consider suitably constructed long-time-step methods 
(adiabatic integrators”) and multiple tme-stepping methods. 


XIV.1 Linear Systems with Time-Dependent 
Skew-Hermitian Matrix 


We conslider first-order linear differential equations with a Skew-hermitian matriX 
that changes Slowly compared to the rapid oscillations in the solution, a problem 
that has attracted much attention in quantum mechanics. We present a Sultable class 
of numerical methods, termed adiabatic integrators, which can take time steps that 
are Substantially larger than the almost-periods of the oscillations. 


XIV.1.1 Adiabatic Transformation and Adiabatic Invariants 


It comes from the greek aw0vaGarzXos“which cannot be crossed”. 


.We arrive by analogy to the“adiabatic Principle”used in Quantum 
and then Classical Mechanics. It is based upon the fact that the harmonic 
oSscillator (and other simple dynamical Systems as it was found later) Sub- 
mitted to Slow variations of its parameters modifies its energy but keeps 
its action (energy divided by frequency) constant. 


As we can see, the path from the word “adiabatic”used in thermodynam- 
ics to the above “adiabatic principle”is tortuous and our greek colleagues 
are certainly puzzled by sentences Such as“the changes in the adiabatic 
invariant due to [.…] crossing”which we shall use later. 

(J. Henrard 1993) 


We consider the linear differential equation 


532 XIV. Oscillatory Differential Equations with Varying High Frequencies 


8 = 工 2(99， 站 洒 


where 2G(b is a real skew-symmetric (or complex skew-hermitian) matrix-valued 
function with time derivatives bounded independently of the Small parameter <. 
In quantum dynamics such equations arise with 2( 旭 = 一 ;五 ( 扩 ,where the real 
Symmetric (or hermitian) matrix 瓦 (加 represents the quantum Hamiltonian opera- 
tor in a discrete-level Schrodinger equation. We will also encounter real equations 
of this type in the treatment of oscillatory classical mechanical Systems with time- 
dependent frequencies. Solutions oscillate with almost-periods ~ s，while the SyS- 
tem matrix changes on a Slower time Scale 下 工 . 


Transforming the Problem. We begin by looking for a time-dependent linear 
transformation 
JI 区 一 了 人 鸭 V)， (1.2) 


taking the System to the form 
， 1 
旋 四 一 S( 罗 7 with 5 一 于 7 十 歼 2TT 1 (1.3) 


which is chosen such that Se(tb is of smaller norm than the matrix 二 2 (b) of (1.1). 


Remark 1.1.A first idea is to freeze 2G 力 久 G over atime step and to choose the 
transformation 


歼 共 二 emp(--2.] yielding Se 伯 一 : ep(--2,] (5 的 = 及 ) exp{( 2 有 )， 
This matrix function 9-( 划 is highly oscillatory and bounded in norm by O(P/s) 
for 上 一 如 | 乏 几 让 2. = 和 (加 十 几 /2). Numerical integrators based on this trans- 
formation are given by Lawson (1907) and more recently by Hochbruck 多 Lubich 
(1999b),，JIserles (2002, 2004), and Degani 儿 Schiff (2003). Reasonable accuracy 
Still requires step sizes 玉 = O(s) in general; see also Exercise 3. In the above pa- 
pers this transfomation has, however, been put to good use in Situations where the 
time derivatives of the matrix in the differential equation have much Smaller norm 
than the matrix itself. 


Adiabatic Transformation。 In order to obtain a differential equation (1.3) with a 
uniformly bounded matrix 9-( 芭 we diagonalize 


2Z00 =UODiADU(G* 


with a real diagonal matrix 4(b 三 diag(Xj( 塘 ) and a unitary matrix UV 人 三 
(ui 人 加 ,un 人 (t) ofeigenvectors depending smoothly ont (possibly except where 
eigenvalues cross). We define 7)( 如 by the unitary adiapatic trazzs1jp7rmaatiomn 


00=exmp(-28(00)]U(D'yG0 wih 80=diag(bj()= / 40s) ds 
(1.4) 


XIV.1 Linear Systems with Time-Dependent Skew-Hermitian Matrix 533 


which represents the solution in a rotating frame of eigenvectors. Each component 
of 7( 切 is a coeffcient in the eigenbasis representation of y( 妇 rotated in the com- 
plex plane by the negative phase. 9uch transformations have been in use in quan- 
tum mechanics since the work of Born 儿 Fock (1928) on adiabatic invariants in 
Schrodinger equations, as discussed in the next paragraph. The transformation (1.4) 
yields a differential equation where the s-independent Skew-hermitian matriX 


is framed by oscillatory diagonal matrices: 


1D = exp(-2800) (GD exp(280) 7 (1.5) 
Numerical integrators for (1.1) based on the transformation to the differential equa- 
tion (1.3) with bounded, though highly oscillatory right-hand Side， are given by 
Jahnke & Lubich (2003) and Jahnke (2004a)j; See Sect. XIV.1.2. 


Adiabatic Invariants， Possibly after a time-dependent rephasing of the eigenvec- 
tors, 居 (一 ezax(ot( 坟 ,we can assume that zk(t) is orthogonal to wk 人 ( for 
all 上 (This is automatically satisfied 诺 Vis a real orthogonal matrix, as is the 
case for 2 区 = 一 瓦 雪 with a real symmetric matrix 巨人 ( 轨 .) We then have the 
matrix 全 一 (wj) 三 (UK) with zero diagonal. 

After integration of both sides of the differential equation (1.9) from 0 to 志 
partial integration of the terms on the right-hand side yields for 7 夭 KR (terms for 
7 三 K do not appear since 记 二 0) 


/ emp 人 (一 (bi 四 几 (9))) ui(s) (5) ds 


长 


2 NS) ts) 
一 (9) 一 各 人 )) | (1.6) 


; Q 2a07 (s)7k(5) 
(加 一 帮 ) 攻 攻略 





一 1E exp 人 ( 


At this point, suppose that the eigenvalues Xj (加 are, for all 盛 separated from each 
other by a positive distance 0 independent of s: 


|N 昌 一 Xe(b 5 forall 7 夭 大 (1.7) 


Then the reciprocals of their differences and the coupling matrix 歼 ( 轨 are bounded 
independently of s, as are their derivatives. TIogether with the boundedness of 7 as 
implied by (1.5), this Shows 


1( 坟 三 700) 二 O(e) for 工科 Const. (1.8) 


This result is a verslion of the quantum-adiabatic theorem of Born 儿 Fock (1928) 
Which states that the actions |7j|? (the energy in the 7th state，(77ju 刀 7ju)》 一 
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入 j|77 2, divided by the frequency 入 四 Temain approximately constant for tmes t 一 
O(1). Such functions 7T(V, 奴 that satisfy 7(y 坟 ) 尹 = TV(0),0) 二 OUe) for 上 = 
O(1) along every O(1)-bounded solution y(b ofthe differential equation, are called 
GdQiaDaic 10VG1IG115. 





Super-Adiabatic Transformations. Adiabatic invariants are obtained over longer 
time Scales by refining the transformation; See Lenard (1939) and Garrido (1904). 
Here we Show that the transformation matrix 7Y<- of (1.2) can be constructed Such 
that the matrix 95- in the transformed differential equation (1.3) is of size O(eX ). 
Let us make the ansatz of a unitary transformation matriX 

世 


T(N) 一 exp( 一 29) exp( 一 ii) exp(eX1) .exp(-icv-IGN)exp(sENXN) LU 


with real diagonal matrices @v (为 and complex skew-hermitian matrices Xn ( 轨 . We 
find that 9。 王 GT 十 Te72 is O(e) 让 and only 让 Xi and 41 := Di satisfy 





1 
一 (exp(ex) 14 exp( 一 <X1) 刘 4) 24 十 到 =O(e)， 
E 
or equivalently, if Xi and 41 solve the commnutator equation 
[4, XI 二 241 = 帮 


This is solved by setting ;41 equal to the diagonal of 克 and determining the off- 


(4D) 


diagonal entries 7 of Xi1 from the scalar equations 


iA7 一 AF) 列 忆 一 WO1j，， 了 天 刀 


which can be done as long as the eigenvalues are Separated. The diagonal of X1 is 
Set to Zero. Since 人 凡 is Skew-hermitian， So ls X1. Similarly we obtain for higher 
powers of s the equations 


[24, Xn] 十 14， 二 帮 ，1， 


Where the matrix 他 1 contains only previously constructed terms Up to index 即 一 工 
and derivatives up to order 7 and is Skew-hermitian because 9e is Skew-hermitian. 
In this way we obtain a uniltary transformation Such that 

mVN 一 TO satisfies Tv) = O(eN). 


We remark that the above construction of TY) ls analogous to transformations in 


也 amiltonian perturbation theory; cf. Sect. 和 .2. 

The differential equation (1.1) thus has adiabatic invariants over times O(e-Y) 
for arbitrary N > 1 and in fact even over exponentially long time intervals 
左 三 O(ec/ <) 让 the functions have a bounded analytic extension to a complex strip， 
as ls Shown by Joye 和 Pfister (1993) and Nenciu (1993). The leading term in the ex- 
ponentially small deviation of 必 ( 旭 |” in the optimally truncated super-adiabatic 
basis has been rigorously made explicit by Betz 儿 Teufel (2005a, 2005b), proving 
a conjecture by Berry (1990). 
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Avoided Crossing of Eigenvalues and Non-Adiabatic Transitions. Io illustrate 
the effects of a violation of the Separation condition (1.7), we consider the generic 
two-dimensional example studied by Zener (1932), with the matrix 


2 三 一 ? 芭 0 (1.9) 


which has the eigenvalues 士 1V 妈 十 02. The minimal distance of the eigenvalues is 
20 att =0.For0 = O(Vs) the adiabatic invariance (1.8) is no longer valid, and 
7 can Undergo O(1) changes in an O(0) neighbourhood of 上 = 0: a 7zo1-adiapatic 
1ra11sztio7 in physical terminology. The changes in the adiabatic invariant due to the 
avolided crossing of eigenvalues are illustrated in Fig.1.1 and can be explained as 
follows. 








1.0 上 











| | ， | | | 
一 .4 一 .2 .0 .2 .4 





Fig. 1.1. Non-adiabatic transition: |7i (加 |2 and |mz( 划 | as function oftfors = 0.01 and 
0 (increasing darkness) 


Near the avoided crossing, anew time scale7 = 上 /0 is appropriate. The decom- 
position G( = DT( 人 (DLL ofthe matrix yields 


rg = En = 人 0 


sin ac(7) coSs Q(T) 
人 


with aw(7) 一下 一 二 arctan(7). We introduce the rescaled matrices 


三 


辣 辣 二 / 0 


0 
2 d ~ 到 1 0 一 1 
Or) = (二 5 并 Cr) = 天 一 = 0. 丈 (. 
m =- (和 orjpgom -nn( 了 
Note that the entries of 作 (b have a sharp peak of height (20)-1 att = 0. The 


rescaled function TI(7) = 7(b is asolution ofthe differential equation 
2 


全 这 = exp ( -三 5) 丈 (r) exp (一 5 丈 仆 ， 
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For 62 < sand|lr| = |45| < 1 the matrix on the right-hand side is bounded of 
norm ~ 人 and has bounded derivatives with respect to 7. The function 7(7) therefore 
changes its value by an amount of size O(1) in the interval |7| 冬 1. We also note 
that any numerical integrator using piecewise polynomial approximations of 十 ( 萎 
and hence of 刺 (r) must take step sizes Ar = 1/6 娘 1lie, 几 < 娘 0.0Onthe 
other hand, the rescaling Shows that the number of time steps needed to resolve the 
non-adiabatic transition up to a specified accuracy is independent of 0. 


XIV.1.2 Adiabatic Integrators 


We discuss Symmetric long-time-step integrators for the rotating-frame differential 
equation (1.5) that describes SKkew-hermitian Systems in adiabatic variables. The 
construction follows Jahnke 儿 Lubich (2003) and Jahnke (2004a)j; see also Lorenz， 
Jahnke 久 Lubich (2005). 


First-Order Integrators。 We consider the differential equation (1.5) and integrate 
both Sides from 怒 to 怒 +1 三 如 十 由 : 


71( 刀 +1) 三 7( 刀 ) 十 矿 exp( 一 509) 本 (5s) exp( as)) 1(s)ds， (1.10) 


Where 厂 信 is an s-independent matrix，continuously differentiable in 妃 and the 
real diagonal matrix of phases GB(t is given as the integral of 4( 为 = qiag (入 )(). 
In the applications, 厂 (为 andG@(b) are not given explicitly, but need to be computed 
using numerical differentiation and integration, respectively. For Simplicity, we here 
ignore this approximation and consider 人 ,5, 4as given time-dependent functions. 

Since 7 and 他 have bounded derivatives, the following averaged version of the 
implicit midpoint rule has a local error of O(j2) uniformly in s:1 


你 1 7 1 
7n+l 三 71mn 十 / exp( 一 :5(s)) 全 (tn+172) exp( 28(5)) ds 本 (n+1 十 7n) 
tm 
(1.11) 
The problem then remains to compute the oscillatory integral. The integrand can be 
TewWriitten as 


忆 (@(5)) 昌 全 (如 +1/2)， 


Where e。 denotes the entrywise product of matrices and 
1 
BE(g= (oji)  wih eexp( 一 ( 仙 一 加 


With a linear phase approximation (of an error O(P2)) 


更 (如 +172 十 07) 入 屯 ( 加 +172) 十 01.4(tn+17/2)， 


1 Because of the oscillatory integrals, the local error is not O(ha) as might at first glance be 
expected for a symmetric method. 
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the integral js approximated by 
尺 五 ( 加 12)) (加 T12)e 全 ( 如 172) 


whereZ(b) is the matrix of integrated exponentials with entries (we omit the argu- 
ment 如 


1/2 20 及 国人 
率 ep 人 (二 WA)00=shmec( 关 (一 


The error in the integral approximation comes Solely from the linear phase approx- 
imation and is bounded by O(/ ， 生 ， ) 一 O(12) 证 the Ai are separated, because 
then the integral Zjj is of size CO 人 呈 ) .Wethus obtain the following averaged zzPlictt 
1aidpoint rule with a local error of O(j2) uniformly in s: 


工 
mtH 一 俩 十 有 (BE(bssaja))sT(nsijas 到 (bja)) (nt 十 轴 ) (L19) 


An analogue of the explicit midpoint rule is similarly constructed, and from the 
Magnus series (TIV.7.3) ofthe Solution we obtain the following averaged expome11ial 
1idpoimt rule, again with an O(72) local error uniformly in cs: 


mi 一 exp( 放 BEE(nHya)sI(ptias(nta) 加 (4.19) 


For skew-hermitian 歼 ( 人 ,also the matrix in (1.12) and (1.13) is skew-hermitian， 
and hence both of the above integrators preserve the Euclidean norm of 7 exactly. 
We Summarize the local error bounds for these methods under conditions that in- 
clude the case of an avoided crossing of eligenvalues. 


Theorem 1.2 (Local Error). SUPPose 1Patjort 如 所 寺 和 如 十 凡 a1d 011 7 太 


的 -AI>5 II<co， Il< 字 ，lwl< 号 


Wi 0 > 0. 7T1em 加 e Vocal error ofretpodsy (7.12) a1d (71.73) 1 poxaded by 
六 2 
Im 一 ?to 十 鸯 | 训 下 ol 
7T71e comstap1t C 1 idependent of 几 ,E; 0. 


Prooj Theresult is obtained with the arguments and approximation estimates given 
above, taking in addition account of the dependence on 0. 














The local error contains Smooth, non-oscillatory components which accumulate 
to a global error 7 一 9( 如 ) = O(P) on bounded intervals 这 the eigenvalues remain 
well separated. Using that in this case 7 is constant up to O(sj, this error bound 
can be improved to O(min{fes, 几 )). The integrators thus do not resolve the O(s) 
oscillations in 7 for large step Sizes 娘 > s, but like in Jahnke 多 Lubich (2003) 
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they can be combined with a (Symmetric and scaling-invariant) adaptive Step Slze 
strategy Such that the methods follow the non-adiabatic transltions through avoided 
crosSings of eligenvalues with Small steps and take large steps elsewhere. 

We here consider applying an ipteg1atli128 eyersipie step SiZe co1ltjoller as im 
Sect.VII.3.2 with the Step Size density function 


一 1/2 
5 的 一 ( 丈 的 性 + oo) 
for a parameter a that can be interpreted as the ratio of the accuracy parameter 
and the maximum admissible step Size. Choosing the Frobenius norm | 天 | = 
(trace TFTT)UV2，we then obtain the following version of Algorithm VIIL3.4， 
Where HUis the accuracy parameter and 
0 人 tt 芭 
GO= -2 = (POP+o9 aace(OOmTO) 


Set 20 = 1/c(t 如 ) and, for7 > 0， 





Zrn+1/2 二 2 炒 G(tn) 
Rn+172 至 HL/ zol1712 
如 +1 三 如 十 mnTI172 (1.14) 


7 77n+l by(1.12) or (1.13) with Step Size Rn +172 








2r+1 一 /2 十 六 Gin+1) 


We remark that the Schemes (1.12) and (1.13) can be modified such that they use 
evaluations at 如 and 如 +1 instead of 如 +1/2 (ExXercise 0). 

Applying the above algorithm with accuracy parameter /三 0.01 and a 三 0.1 
to the problem of Fig. 1.1 with se = 0.01 and 6 = 2-1,2-3,2-5,2-7 yields the step 
SlZze Sequences Shown in Fig. 1.2. In each case the error at the end-point 上 三 1 was 
between 0.5.10-3 and2 .10-3. 





10-1 


10 一 


1073 








1 村人 人 


8 0 47 0 2 4 0 8 10 


Fig. 1.2. Non-adiabatic transition: step Sizes as function of t for es = 0.01 and 9 = 
2- ,2-3,2-5,2- 7 (increasing darkness) 





| 
忆 


Second-Order Integrators. The O(s) oscillations in 7 are resolved with step sizes 
up to 丸 = O(Vs) for methods that give O(12) accuracy uniformly in s. Such 
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methods require a quadratic phase approximation, and one needs further terms ob- 
tained from reinserting 7(s) under the integral in (1.10) once again by the same 
formula, thus yielding terms with iterated integrals (this procedure is known as the 
Neu1aia11z or Pea1o or DySsom expansion in different communities, cf. Iserles 2004)， 
or by including the first commnutator in the Magjxs expansion (IV.7.3). Symmetric 
Second-order methods of both types are constructed by Jahnke (2004a). 

Care must be taken in computing the arising osScillatory integrals. Iserles (2004) 
proposes and analyses Filon quadrature (after Filon，1928)，which is applicable 
when the moments，i.e., the integrals over products of oscillatory exponentials 
and polynomials, are known analytically. This ls not the case, however for all of 
the integrals appearing in the Second-order methods. The alternative chosen by 
Jahnke (2004a)j ls to use an expansion technique based on partial integration. The 
idea can be jllustrated on an integral Such as 


1 eof co( 畏 ) 区 


with aw 夫 0. Partial integration that integrates the first factor and differentiates the 
Second factor yields a boundary term and again an integral of the Same type, but 
now with an additional factor OO 人 . 吾 ] 一 O( 由 ). Using this technique repeatedly 
in the oscillatory integrals appearing in the Second-order methods permits to ap- 
proximate all of them up to O(j13) as needed. We refer to Jahnke (2004a) for the 
precise formulation and error analysis of these Second-order methods，which are 
complicated to formulate, but do not require Substantially more computational worK 
than the first-order methods descriped above, and just the Same number of matrix 
evaluations. 








Higher-Order Integrators. Integrators of general order p > 1 are obtained with a 
phase approximation by polynomials of degree p and by including all terms of the 
Neumann or Magnus expansion for (1.3) with up to p-fold integrals. 


XIV.2 Mechanical Systems with Time-Dependent 
Frequencies 


We Study oscillatory mechanical Systems with explicitly time-dependent frequen- 
cies, where the time-dependent Hamiltonian is 


1 _ 1 
瓦 (p,g 王 了 D MX 人 的- P 十 04(Dg 十 (qd 加 (2.1) 


with a positive definite mass matrix AM(t) and a positive semi-definite stiffness ma- 
trix 4 (为 of constant rank whose derivatives are bounded independently of s. Such 
a Hamiltonian describes oscillations in a mechanical System that at the Same time 
exerts a driven motion on a Slower time Scale. We consider motions of bounded 
energy: 
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厅 (P(Dd(b 胃 芭 Const、 (2.2) 


We transform (2.1) to a more amenable form by a Series of linear time-dependent 
canonical coordinate transforms. The transformations turn the equations of motion 
into a form that approximately Separates the time Scales. This makes the problem 
more accessible to numerical discretization with large time steps and to the error 
analysis of multiple time-stepping methods applied directly to (2.1) im the originally 
given coordinates. 


XIV.2.1 Canonical Transformation to Adiabatic Variables 


By a series of canonical time-dependent linear transformations，which can all be 
done numerically with standard linear algebra routines，we now take the Hamil- 
tonian System (2.1) to a form from which adiabatic invariants can be read off and 
which will serve as the base camp for both the construction and error analysis of 
numerical methods. 

We introduce the energy 一 as the conjugate variable to time ti and extend the 
了 amiltonian to 


一 、 


瑟 (D, 书 ,9q 才 三 万 (D,9 力 十 五. (2.3) 


The canonical equations of motion are then (the gradient V refers only to 9) 


1 
六 王 一 三 40b9 一 VU (9 为 


9 MO 2 
along with 瑟 = 一 9 厅 /btandt = 1. 


Transforming the Mass Matrix into the Identity Matrix. We change variables 
Such that the mass matrix AM (bin the kinetic energy part is replaced by the identity. 
With a Smooth factorization 


MO = CC (2.4) 


e.g., from a Cholesky decomposition of M ( 划 , we transform to variables (9 为 by 
4 一 CGOD = 志 


Then, the conjugate momenta are given by (See Example VI.3.2) 


国 -0 9 鸭 -(e2 


With the transformed matrix 4 = CT74C, the Hamiltonian 万 (六 五 ,六 力 二 


一 、 


万 (D, 瑟 ,9q, 恕 inpthe new variables then takes the form (we omit all tildes) 


1 工 。 
瓦 (P, 忆 ,9 蚊 一 了 2 十 04(09 一 CC po 十 瑟 . 
(2.5) 
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Diagonajlizing the Stiffness Matrix. We diagonalize the matrix 4( 廊 ip (2.5)， 


至 0 0 并 
49=@g (0 ob)eg 0 
with the diagonal matrix 12( 坟 三 diag(wj( 为 ) of frequencies and an orthogonal 


matrix @C( 恕 , which depends smoothly on 上 ifthe frequencies remain separated. The 
matrix @G( 人 (如 can be obtained as the product 


T 0 
站 =Q@otlt 2.7 
Where the transformation with Co(b takes 4(b to the block-diagonal form 


49=oog(0 4)eoer 


and Q:(t diagonalizes 4,(t. The effect of an avoided crossing of frequencies is 
localized to Q (加 , which then can have large derivatives,， whereas those of Co 人 
remain moderately bounded. The transformation 


4=Q(OO2 = 
with the conjugate momenta 
分 = Q(O7p， 五 = 部 Q(DTp 十 五 
yields the Hamiltonian in the new variables ( 怀 ， 五 ,05 娘 as (we omit all hats) 


1 0 0 
WwWith 
(Koo Ko yy Drnm PrAmrrc 
本 | 有 站 人 


We decompose also 


according to the blocks in (2.0) and refer to go and al (po and pl) as the yiow and 
Jast positions (Slow and fast momenta)j, rfespectively. With the energy bound (2.2) 
we have 


D1=O)，4=O(e). (2.9) 
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Rescaling Positions and Momenta. We transform 
g 一 各， 和 1 一 s02071020， 革 一 
with the conjugate momenta 
总 一 Do， 六 一 sl20-12p1， 瑟 = 一 好 cV20-3/20p 十 忆 . 
In the new variables, the Hamiltonian becomes (we omit the haceks on all variables) 


了 厅 
元 号 人 (的 pl 十 元 qi 12(b9l (2.10) 


十 虹 开 (人 pp 十 TOT(d 人 十 吾 


1 
一 了 20 Do 十 


WwWith 
人 成 0 EL/2K0PDIL/2 
二 El/22-1/2K10 -LI2FKI1DI/2 于 二 102 
二 1712 _ To sl27o AN 了 0 
4 人 EL/ 也) 一 个 1/27 1 一 CQ 0 <L2PD-1/2 上 


Eliminating the Singular Block. We nextremovethe O( E-1/2) off-diagonal block 
in 玉 by the canonical transformation 


一 D1 一 一 页 十 sEL2D-12K89， do 一 0，t 上 = 


with the conjugate variables 


Q 
(Kol2-T 30 


责 =g， 丙 po+sL2KolD-120， 及 二 及 下 名 “的 款 


JIn these coordinates, the Hamiltonian takes the form (we omit all bars) 


1 全 1 
一 DT 人 一 07 2.11 
区 如 ( 轨 Pl 十 元 9 ( 蚊 @1 (C.11) 


+ OLDp 十 97S(Dg+D(T(bg 昌 十 至 


1 
瓦 = 了 00po 十 


with the lower block-triangular matriX 


Zoo0 0 
瑟 忆 一， 
(:， Zi11 】 


玫 00 0 
cl202-172(Kio 二 KK) DT2FKD212 十 二 2711 


and the Symmetric matrix 
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5-( 95o = 1/2501 
NeL25 10 2911 
where 


5oo = 一 KolKd， 
50l = 5 四 = 一 KooKolC-L2 


， Q 
Ko-U200U2KTD-12 十 了 321 人) - 亏 (Kol2-1Y3)， 
S11 一 CD-TL2( 一 KKol 一 天 各 民 太 十 天 人 Ko)O2-L2， 
We note that with the energy bound (2.2) we now have 


D= Ole)，g = O(eU3)， (2.12) 


了 quations of Motion. The differential equations now take the form 


Do j(p,9, 方 
do = po 二 go(9, 六 (2.13) 


ER 


with the functions bounded uniformly in <， 





( 旬 = ZGp 一 5009 一 TYUCT(D9,， (JR 


The matrix in the System is diagonalized by a constant unitary matrix: with 


17/7T 7 
r= 坊 (总 (2.14) 


人 0)- 下 0 痊 


Remark. Action-angle variables D17 一 VGJ COS 0 91 一 VG7 Sin 0 for the har- 
monic osScillators would now put the Hamiltonian into the form 互 三 zw 人  Q 十 
CG(a,0,po,qo, 加 ,which could be studied further using averaging techniques, that is， 
Using coordinate transforms that reduce the dependence on the angles in the Hamil- 
tonian; See Neishtadt (1984) for averaging out up to an exponentially Small remain- 
der in the case of a single high frequency. The first-order averaging transform might 
be done numerically (cf. the formulas in 9ect. XII2)，but the higher-order trans- 
forms involve increasingly higher derivatives of the functions involved and there- 
fore become impractical from the numerical viewpoint. For Systems with Several 
frequencies the averaging transforms require multi-dimensional integrals which are 


We have 
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expensive to compnute. For our numerical purposes we therefore continue differently， 
adapting the adiabatic transformation of 9ect. XIV.1.1. 


The System in Adiabatic Variables. Let the diagonal phase matrix be given as 


上 
2 0 
z0= 人 4(s)ads with 4 扩 = ( 0 
Our final transformation follows (1.4) and sets 


_ ~ 一 1/2 本 * | 人 P1 
站 一 上 exp 人 ( 5 人) 工 人 (2.16) 


The factor s-I1/2 is chosen for convenience so that (2.12) Implies 
7 一 O(1). (2.17) 


Weremark that up to now all transformations were invariant under rescaling s 一 0E 
and 4 人 一 o24( 坟 ,buthere we have chosen to give up this invariance in favour of 
(2.17). Note that 7 ls of the form 


一 1/2 : 
二 ec-L27* 网 三 六 | 2.18 
7 别人 ri (2.18) 





WwWith real vectors Tr, D Satisfying 


二 
不 十 ?0 一 en(-:/ 12(5) ds】 (P1 十 ?01). (2.19) 
to 


We denote the Inverse transform as 


人 


Together with e = 五 十 去 12( 鸭 Dl 十 去 df 2(bai andunaltered po, go,tthis yields 
a canonical transformation (Do, Ti; eqo,p, 旭 上 (Do,D1, 瑟 ,qo,q1 加 .The Hamil- 
tonian reads in these variables 


1 1 
囊 =5p0pPo 十 7Z(bp 十 5 S(D9+UTG)dD 十 e 


where on the right-hand side the components pl, 9l are expressed in terms of 7 and 
Ti 0 by (2.20) and (2.18). The equations of motion now become 


Do0 所 (P, 9 加 
do = po 十 go(d, 


这 这 -2exp( 一 -5 的) rs [0 


91(d, 训 
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with pl1, 91 expressed in terms of 7 by (2.20). Written out, the differential equations 
for po, go read 


Do 2 Zoop0 90000 TY VT(Toao， 如 四 5901G171 
一 78 (Ya 十 smQi7 一 YU(UTbao， 日 】 





0 = po+Loo+esL1Qar. (2.21) 


The matrix multiplying 7 after Substituting the expressions 万 and gil in the differ- 
ential equation for 7 becomes, apart from the oscillatory exponentials， 


* |/ 一 Z11 一 2911 
他 ”= 王 ER 0 三 永 (2.22) 


克 1 人 一 大 和 Z11 十 中 2E 人 人 


2AW7n+7 1- 2\ -5 9591 


which has a diagonal of size O(s). The equation for 7 then reads 
放 = exp( 一 800) (GD exp(28(00)7 
一 全 (Zaopo 二 Sioqo 十 到 VU(Tooo 十 smTQi， 日 ) (2.23) 


The matrix multiplying 7 is bounded independently of s, but highly oscillatory. Note 
that the coordinate transforms leading to (2.21), (2.23) are linear and can be carried 
out by standard numerical linear algebra routines. 


Adiabatic Invariants. We suppose that the eigenfrequencies wj (t remain separated 
and bounded away from 0: there are0 > 0 and c > 0 such that for any pair wj 人 
and wj 人 (加 with 天 开 (7 三 1 70],thelowerbounds 


lo 的 一 wk 5， wj 的 > (2.24) 


hold for all 4 under consideration. Under condition (2.24) the right-hand side 7 人 (如 
in the differential equation for 7 consists only of oscillatory terms, up to O(s). (No 
Smooth terms larger than @O(s) arise because the matrix 作 has a diagonal of size 
O(s).) It then follows by partial integration that 


小 7r(s5)qds 王 O(e) for 寺 和 Const,， (2.25) 
0 


1 加 三 7(0) 十 O(e) for 寺 和 Const. (2.26) 


万 = GO) (2.27) 
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are thus adiabatic invariants: 
万 ( 妇 三 万 (0) 十 O(s) for 寺 冬 Const (2.28) 


starting from a Hamiltonian System (2.1), where the mass matrix equals the identity 
and the stiffness matrix is already diagonal， we find that 7 is the action (energy 
divided by frequency) 








万 国人 (了 的 ?+ 到 四 9 中 ， 


oj 有 22 2c2 


Which for a constant frequency wj becomes a constant multiple of the oscillatory 
energy considered in 9ect. XIII.9. 


The Slow Limit System. As = 一 0, the evolution of the Slow variables Do, go is 
governed by the equations 


Do = 一 Zoo(po 一 Soo(Dao 一 (DYVU(CT(Dao) 
如 = po 二 Zoo(b7 go (2.29) 


which is the system with the time-dependent Hamiltonian 


工 
互 o(o, qo, 电 一 了 DO Po 十 9 Zoo( 罗 po 十 5 00 Soo( 扩 ao 十 D(To(bgo, 汶 . 
We conclude this subsection with a Simple illustration of the above procedure. 


上 xample 2.1 (Harmonic oscillator with slowly varying frequency). For the scalar 
Second-order differential equation 


(7 
E2 





4=(0， 


where w(t is bounded away from 0 and has a derivative bounded independently 
of s, the above transformations Simplify considerably. The Hamiltonian in the orig- 
inal variables ls already of the form 


lw 人 ， 
2 c2 d ， 





1 2 
巡 三 5 十 


and hence the first two transformations are not needed at all, and there are no Slow 
variables po, go. The rescaling transformation yields the Hamiltonian (2.10) in the 


要 人 的 人 人 
(tt (tt 1w(t 

石 一 2 十 7 和 浊 Dd. 
2 了 2 2w 人 (加 


With the adiabatic transformation (2.19) we thus represent the Solution as 





+ 一 exp 人 人 / dsjcGg 
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where5 三 和 十 ?0 Solves the differential equation 


t0=-2 en( 于 /oojcg 


万 如 





and satisfies C( 轨 一 C(to)(1L+O(e)) fort=O(1). (nthe above notation, we have 
刀 一 上 s -2((， 0C)7.) The action 





7 的 = 二 (了 092 十 汪 呈 -o()9) 


9 


ls an adiabatic invariant. 


XIV.2.2 Adiabatic Integrators 


ASimple long-time-step integrator for the oscillatory mechanical System with time- 
dependent Hamiltonian (2.1) now reads as follows: 


- Solve the slow limit System (2.29) for po, qo, e.g., by the Stormer-Verlet method. 
-- 下 eep the adiabatic variable 7 constant at its initial value. 


Under the condition of bounded energy (2.1) and the frequency Separation condition 
(2.24), the errorin 7 is then O(s) over intervals 上 < Const. by (2.26). The difference 
between the solutions of (2.21) and the limit equation (2.29) is bounded by @O(e?) 
for 上 入 Const., as can be Shown by forming the difference of the equations, inte- 
grating, estimating the integral of the extra terms by O(e2) using (2.26) and partial 
integration, and applying the Gronwall inequality. In the original variables D, q of 
(2.1) this yields an error Of(s2) in the positions and O(s) in the momenta. 
More refined integrators are needed for two Independent reasons: 


1. to keep control of 7 on Subintervals where the frequencies are not well separated 
and where 7 may thus deviate from its near-constant value; 
2. to obtain higher order of approximation on intervals with separated frequencies. 


We Simplify the following presentation by assuming that the potential L is quadratic: 
工 


with asymmetric matrix C(t) depending smoothly on 上 We leave the required modi- 
fications for general LU to the interested reader. Alternatively, the method with 过 三 0 
can be used in the splitting approach of Sect. XIV.2.3 below. 

An adiabatic integrator as described in Sect.XIV.1.2 can be extended to (2.23) 
and combined with a Symmetric splitting between the weakly coupled Systems 
(2.21) and (2.23): we begin with a Symplectic Euler half-step for po, 9qo (denoting 
the time levels by Superscripts)， 
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而 吉大 二 2 2 (Zoopi/ 十 (Soo 十 到 GT7b)98 
+e(Soa+ 玫 GT)oim) (2.30) 
的 = 加 十 2( 四 0 F Zoog0 十 =ZioQim 人 


Here the matrix functions 了 00, 了 10, 9500, 901 70, Tare evaluated at 刀 /2 三 如 十 
几 /2, and @j is the average of the oscillatory function Qi of (2.20) over the half- 
Step， 


2 t/2 
os 人 ou 
to 


obtained with a linear approximation of the phase 瑟 ( 芒 and analytic computation of 
the integral. We then make a full step for 7 with Eq. (2.23) like in (1.12)， 


1 
全 = 信 十 Ag) e7T。 厂 ) 5 5(7 思 十 矿 ) 
一 PP; (Zaopo +(5o 上 ITZGTb)al/ (2.31) 


Where again all matrix functions are evaluated at 妈 /2,， and Pi is the linear-phase 


approximation to the average 
1 人 
户 二 / 记 伯 示 
尺 js 


The matrix 玫 is as in (2.22), but with 911 replaced by 911 十 77 GT1. The step ls 
completed by ahalf-step for pzo, aqo with the adjoint symplectic Euler method: 


2D0 = 2 2 (Zoopi/2 二 十 (5oo 十 7 GT7b)dd 
+e(5o+ 玫 GT)otm) (2.32) 
0 = 0 + 有 丰 0 十 ED Qt 


Where the matrix functions are Still evaluated at 娘 /2，and QT approximates the 
average of Cl over the second half-step. 

We now give local error bounds for this integrator, under conditions that include 
the case of an avolided crossing of frequencies. 


Theorem 2.2. 9UPDPose 1Pat 1jpe Jpctionsy 态 (2.1) G1e 8S1100 矶 Ga11d 1je jegue1iciey 
Sa1ish (2.24) wii 1mipizral distance 0 > 0 Jort 如 区 二 近 加 十 见 anad tpe ortpogo- 
1G1 Ia11ix Qt or (2.7) wjicp diago1ajizes the OA Pa11t of 1je stress 
7117i jas derivatives bornaded py Q = O(0-D)，Q( = O(0-2). 4sszme 
Jrtper tpe eergy pou1d (2.2) Jortpe itial yalues. 7T1em 1pe 1ocal error of retpod 
(2.30)-(2.32) 1 Dot1ided Dy 
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了 一 po(to 十 站 = O(/52 十 O(e12 15 
0 一 (如 十 用 O(12]/5) 十 O(e12/6) 
人 一 It 十 站 = O(12/0). 


| 





771e co1sta11s Sy1ipolized py O do 1zof depe1d om =, 几 da12d 0. 

Proojf (a) Under the given conditions we have 
Koo=OU)，Kol = O()，Kio = OU)，Kii = O(0 )，and 
Koo = OU)，Kol = O(007D)，Kio = O(0)，FK = O(072)， 


This yields the bounds 
Zoo,Z10, 5oo, 51 = OU) 


and similarly for their derivatives, and 
Ti, So 910 一 O(00 -0)， i9o91o 一 O(0 2)， 


and hence also 
全 =O(07)， 玖 =O(0 一 2). 


So we have from the energy bound and the differential equation (2.23) for 7， 
1 一 OUD， 放 =O0 1) 
From the differential equations (2.21) for po, go we conclude 
加 =O(0- 上 +O(eo -加 =O(e67). 
(pb) To study the local error in 7, we integrate (2.23) from to to 如 十 彤 and compare 


with the corresponding term in (2.31): 


to 十 玉 
下 妇 人 芍 (Ziomo 十 (Sio 十 到 GTb)a】 人 寺 


to 
-APIi(Potaaai + (9o+ 下 GThttva)g7) 
= OU216) 


where we have used the above bounds and the error estimate for the linear phase 
approximation in the average of 万 (加 , cf. Sect. XIV.1.2， 


访 一 六 万 从 姓 = OU/ 


Combining this estimate with the error bound of the adiabatic midpoint rule for the 
homogeneous equation as given in Theorem 1.2 yields the stated error bound for 771. 


550 XIV. Oscillatory Differential Equations with Varying High Frequencies 


(c) The error bound for the components po, go comes about by combining error 
bounds for the Stormer-VYerlet method (which require the bounds for Do, do) and the 
estimates 


to 十 严 /2 
几 | 
/ cs(5ol 7 GTQa7 才 对 一 加 se(Sol + GT)(ya)Qim0 


to 
二 O(ej2162) 
and 
to 十 尹 /2 贞 
太 mw 和 eriolap)orm= Oleh2/9) 
to 


and the Same estimates for the Second half-step. See also Exercise 7 for a Similar 
Situation . 














In the case of well-separated eigenvalues, the global error on bounded time intervals 
is thus bounded by O(12) 十 O(he) in po,ao fort < Const,andbyO(P) in7.Inthe 
original variables p,q of (2.1), this then yields an error 


一 qt)=O03 上 +O)，m 一 pt) 王 O( 站 for 刀 近 Const. 


With an adaptive step Size Strategy as in Sect. XIV.1.2, it is again possible to follow 
7 through non-adiabatic transitions near avolded crossings of eigenvalues. 

A higher-order scheme with a global error of O(j2) in 7 - in the situation 
of separated eligenvalues - is obtained by replacing the upper line in (2.31) by a 
Second-order adiabatic integrator as discussed in Sect.XIV.1.2, leaving the last term 
in (2.31) unaltered. In the original variables ,qd of (2.1), the error ls then O(12) 
both in positions and (fast and slow) momenta. The error is even O(sj2) in the fast 
positions dl of (2.8), which oscillate with an amplitude O(E). We refer to Lorenz， 
Jahnke 儿 Lubich (2005) for the particular case of second-order differential equa- 
tionsl 十 s 4( 人 (bd = 0 with apositive definite matrix 4 (人 加. 


XIV.2.3 Error Analysis of the Impulse Method 


The transformation to adiabatic variables of Sect.XIV.2.1 also gives new inslght 
into the error behaviour of multiple time stepping methods such as the impulse or 
mollified impulse method discussed in Sections VJIU.4 and XII.1，which do not 
Use coordinate transforms in the method formulation. These methods are of interest 
when the eigendecompositions needed in adiabatic integrators are computationally 
more expensive than doing many Small steps with the fast subsystem,，and when 
evaluations of the potential force are So costly that the computational work for the 
fast Subsystem becomes irrelevant. We conslider the Splitting 


万 一 fast 十 万 slow 


of the Hamiltonian (2.3) with 
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1 加 1 
万 Past(D 媚 , 9 加 二 5D MO 2PD 十 2c2 0 4(Dd 十 耳 
有 Fev( 书 q 蚊 = (0 


The impulse method is given as the composition of the exact fows ofthe Subsystems 
(See Sections VJIII.4 and XIHIHI.1.3): 

有 天 这 op o 他， 
where weare interested in taking long time steps 玉 > ce (with apositive constant c). 
The equations of motion of the Slow Subsystem， 


妃 一 VV(d,， d 0， [ 0， 


are Solved trivially by 
去 用 葡 全 
了 一 2 一 VON 9 和 一 


JIn contrast, the fast Subsystem needs to be integrated approximately, e.g., by many 
Small substeps with the Stbrmer-Verlet method in the original variables (2, q) or by 
one step of the method (2.30)-(2.32) with G = 0 in adiabatic variables (2o, 9o, 77). 
In the following we ignore the error resulting from this additional approximation 
and study the Splitting method with exact fows. 

The error behaviour of this method can be understood with the help of the trans- 
formation to adiabatic variables of 9ect. XIV.2.1. The impulse method in the adia- 
batic variables po, go,m ls obtained py splitting the differential equations (2.21) and 
(2.23). The fast Subsystem is obtained by Simply putting Z = 0 in these equations， 
and the Slow Subsystem reads 


| 


Do -TVYU(Cbo +eme@imb， 如 =0 
放 = 一 下 到 YUChg 二 emQain 汐 
along with 一 0, so that the argument in all the matrices is frozen at the initial time. 


Here 访 (人 and Qi are again the highly oscillatory matrix functions of (2.20). 


Since QI 下 一 0 wehave Qi17 = Const., and therefore, in these variables the flow 
Slow 


772 jls the mapping given by 


信 刀 六 
加 = mo 一 5VYU(Cbmo+eTiQito， 负 = 


| 


父 大 
放 = 7 一 了 下 全 YU(CTog +eTiQi7t)， (2.33) 
where the matrices 70, 克 , 已 ,Qi are evaluated at 如 . In the impulse method, the 
above values are the starting values for a step with OP 基 全 which is followed by 
another application of 47 

Adisturbing feature in (2.33) is the appearance of the particular value 万 (to) of 


the highly oscillatory function instead of the average Pi as in (2.31). 
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We now consider the error propagation for 7 in the case of well-separated fre- 
duencies. Recall that the exact solution then satisfies 7 人 (加 = 7(0) 二 O(s) for 
寺 巡 Const. For ease of presentation we consider a constant step Size 用. 


Lemma 2.3. 4ssu1ae 1e e1lergy po11d (2.2) Jortpe initialyalues. 矿 1pe jegxemciey 
wj 7e1aain sepa1ated jjom eacjhn olpez 芒 en tpe Tesulf dfjier 7 stepsy Satisjiey Jor 
1 几 区 了 区 Co11S1， 


7 三 110 十 an 十 O(s)， (2.34) 
WAere 
1 
< 一 “各 (让 |. 
lenzlg Cn wii 人 央 人 的) | (2.35) 


Pioof We have 1 = 信 ( 刀 )，where 人 (人 ( 轴 solves the differential equation with 
impulses， 


2 2 
太一 一 5) 不 ( 23) An 5 
711 exp 人 ( eXp 0 717 07 
Here 歼 ( 人 is the matrix (2.22) appearing in (2.23), and 


7 从 三 一 开 人 的 (Zio(bpom( 蚊 十 Sol(bgon 信 ) 


With po (, go 人 (区 denoting the piecewise constant functions that take the values 
of the numerical solution. Further we have 


11j = 一 六 户 ( 帮 关 下 (的 蔷 CD 三 )go5 十 三 )Q1()77 太 )， 


the expression on the right-hand side of (2.33), and 0j is a Dirac impulse located 
at ti Itfollows that fort 上 一 mA， 


Jr 一 900 三 1 如 ) 一 7(0) 


一 1 om- 本 (5s) exp( 0) 11j(S) ds 十 7r(s) qds 十 an， 


where an ls the trapezoidal sum of the terms on the right-hand Side of (2.33): 
mr 二 一》 户 ( 押 大大 (的 VE 全)go5 二 sm 全)Qa 人 0 二 )， (2.36) 
JJ=0 


The prime on the sum indicates that the first and last term are taken with the factor . 
Using partial integration as in (1.6), we obtain 


1 so(-2a) 本 (s) exp( as)) 7j(s) ds 一 O(e)， 


and by partial integration as in (2.25)， 
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This shows (2.34). A partial summation in (2.36), Summing up the oscillatory termas 
已 (万 ) and differencing the smoother other terms, then yields (2.35). 














The size of k of (2.35$) depends on possible resonances between the Step Size and 
the frequencies, yielding between O(7) and O(1). For the error of the method we 
have the following. 


Theorem 2.4. 4ssu1ae ie energy pou1d (2.2) Jor 加 e ipifial valuesy. 矿 1pe jegxe1- 
cies cj (7e1aaina sepa1ated Jomaz eacjhn otper 切 en tpe error of ipe jptlse metpod 
afierm stepy with step Size 几 二 csyatisjiey 


DOm 一 D( 刀 ) 人 O(A) 
和 一 dz) = OO(en). 


71e co1sta111s Sy1iDolized py @O do 11of depe1d oj 2 天 0G110 7 WII 1 芭 Const. 


Pioof The error of size O(A) in 7 immediately implies an error of size O(A) in the 
actions 万 一 二 7 上 >, and an error of O(A) in the fast momenta pl and of O(sA) in 
the fast positions 9q1 of (2.9); recall the transformation (2.16) and the rescaling. Im 
the Slow components Do, qo the method is a perturped variant of the Stormer-Verlet 
method. The contribution of the perturbations sTi Qi7 to the error is of size O(sA). 
This is seen by applying the simple lemma below with y = (2o,qo) and 


ad 主 ( 3 ] ET1 ( 刀 )Qa( 如 )T7n 】 二 O(P2e) 


and using partial summation of the dn, Summing up the oscillatory terms Qi(t) 
and differencing the other termas. 














Lemma 2.S. ZetG1n(y) = yy 十 几 E(V) pe aome-step 1ietpod wjpeyre 丽 pay Zipscjiz 女 
CO11S1G11f 了 .Co1lyider 加 e 1ietpod a1zd a Pert1rpatio， 


ynm+L1 一 pyn) G11G +1 和 pr) 十 Qn， 


Wi1P te sa11re Sia11i118 valUesy 0 三 0. 7T1em 1pe dijjerenrce 1 poxnded py 


天 
站 
| 多 一 和 os<e 0<REX 1 2 
了 7 一 


Prooj The result follows from 


RE 也 一 工 
加 二 刀 》 (及 ( 防 ) 有 末 ()) 六 的 
5=0 














with the discrete Gronwall inequality. 
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XIV.2.4 Error Analysis of the Molliftied Impulse Method 


The problem with possible step-Slize Tesonances can be greatly alleviated by the 
mollifhed impulse method (see Sect. XII.1.4) where the potential CU(a,tbisreplaced 
by a modified potential V7(q,. A good choice is 





六 7 0 到 
re-=vUoeb wm 40=c09g(1 的)oorcor 
(2.37) 
with C and CO of (2.4) and (2.6), and 
. 必 | 和 ?25 
SO = sinc( = 2 = 可 人 exp( 二 二 2(0)) ds. 
Acalculation Shows that it rfeplaces (2.33) by 
全 九 六 
让 = Po 一人 YU(Cbm+emeito)， 负 =@ 
六 大 
放 = 7 一 了 FTVYU(CTDo 十 esTiQiny,to)， (2.38) 


with matrix functions evaluated at 10, where Pi ( 塘 and @il (bb are the linear-phase 
approximations to the average over the interval 上 夺 一 丈 , 上 十 有 of 己 and Qi,respec- 
tively， 


1 t 十 疡 


人 


1 t 十 刀 
oO0-s0o0=- 基 人 ou+og 


记功 二 S 鸭 疡 的 己 (s5)ds 二 OU) 


Therefore, (2.34) and (2.36) hold with the highly oscillatory 万 | (万 ) replaced by the 
averages P1 (万 ). Using a partial summation in (2.36) and noting that, for 上 三 ?7 挟 
Const.， 


二 OO =O(e)+O()， 








je 盖 ml=| /ae 


we obtain an estimate 
7 二 70 十 O(A) 


instead of the corresponding bound (2.34) with (2.3$). This eliminates the bad ef- 
fect of step Size resonances (large A) on the propagation in the fast variables over 
bounded time intervals 上 < Const. (though not on longer intervals, as we know from 
Chap.XIID). The more harmless effect of Step Size resonances on the Slow variables， 
as Visible in the term O(sk) in Theorem 2.4, is likewise reduced to O(s 由 .We thus 
obtain the following Improvement over the error bounds in Theorem 2.4. 
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Theorem 2.6. 4ssu1ae ie ene1gy pou1d (2.2) Jor 1je ipifial valuesy. 矿 j1je regue1- 
cies wj remain sepa1ated jjomm each other 1pen tpe error of the abpove 1no1lified 
iDUlse 11etjpod dfier7 stepy with step Size 几 过 ceatisjiey 


Pr 一 pi) = OU 
dm 一 qd( 恕 ) 王 O(P2). 











771e co1sta111s Sy11Dolized py GO do 11of depe1d o1l 2 天 0G10 7 Wi TH 芭 Const. 





Adirect Implementation of this method requires just the same matrix decompo- 
Sitions that are needed for the integrators in adiabatic variables. It is then reasonable 
to Use one Step of the adiabatic integrator of Sect. XIV.2.2 for Solving the fast Sub- 
System over atime step. 

Analternative is to compute the average .4( 加 by small time steps from the linear 
differential equation with the Hamiltonian 瓦 fast，as formulated in Sect. XIII.1.4. 
The method described here then corresponds to (XIHUI.1.18) with c = 十. 


XIV.3 Mechanical Systems with solution-Dependent 
Frequencies 


We2 consider the Hamiltonian 
1 人 | 1 
刀 2g=5D MO D+C(9) + 三 Y(9) (3.1) 


with a strong potential s-2V(d) that penalizes some directions of motion. Analytical 
studies of this problem were done by Rubin 儿 Ungar (1937), Takens (1980), and 
Bornemann (1998). In an alternative approach to these works，we here describe a 
transformation of the problem to adiabatic variables. This gives new inslght into the 
solution behaviour and can be used as the starting point for the construction of long- 
time-step integrators. It also enables us to analysethe error of multiple tme-stepping 
methods. 


XIV.3.1 Constraining Potentials 


We consider the Hamiltonian (3.1), where M(d) is a symmetric positive definite 
mass matrix depending Smoothly on the positions g E 展 ",Z is a Smooth potential， 
and the constraining potential is assumed to Satisfy the following: 


2 This section was written in cooperation with Katina Lorenz (Doctoral Thesis， 
Univ. Tubingen, in preparation). 
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The Smooth function 站 : DCR" 一 及 at- 
tains its minimum value 0 on a wd-dimensional 
manifold )2 C 到"， 


7={foeDIVy =miny=0}， (3.2) 


In a nelghourhood of )，the potential 人 
ijS_ Strongly convex along directions non- 
tangential to ), that is, there exists Q > 0 
Such that for ge ,the Hessian V2V(d) Sat- 
isfies 





uU7V2V(ou > av)v (3.3) 


for all vectors v in the M(9)-orthogonal complement of the tangent space Tv. 
Welet 冯 三 即 一 gbethe number ofindependent constraints that locally describe 
the manifold )7. 


上 xample 3.1 (Chain of Stiff Springs). The position of Im 十 1 mass points in a 
plane, arranged in a chain connected by stiff springs with Spring constants a2 /cs2, is 
determined by the Cartesian coordinates of the first mass point and by 7 angles w; 
and the elongations di of the mm Springs. The constraining potential is 


7 


2 


并 


and the constraint manifold is described by al = ...= dm = 0 corresponding to 
non-elongated Springs. The frequencies of the Vibrations in Such a chain depend on 
the angles. 


In the above example we have, in the coordinates given by the angles and elon- 
gations, a potential Y of the form 


VY(O = 了 4(o)a GJ4 


for d = (qo,gi) E 了 月 4 x 下 mwith a positive definite matrix 4(qo). The manifold 
of constraints is here simply 7 = 及 2 x 0. As the following lemma shows, this is 
already the general Situation in Suitable local coordinates. 


Lemma 3.2，VU7der comditionsy (3.2 广 (3.3) 1ere exisf da S1100 妨 1ocal cjpna1z8ge oF 
coo1dipates dg 王 X(V) SUCP 1Pat 


1 


Y(g) = 了 由 4(o) 阅 or 9 一 X( 


Wi 三 (yo 位) near0 六 陈 4X 限 玫 ,wpeye 4(00) 六 asSymamzetric Positive dejizite 
7 X 770 1711G117X. 
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Pioof Im afirst step, we choose local coordinates dg = 岁 (Z) with z = (Zo,Z1) near 
0 in 了 及 4 x 月 岂 , such that dg = W(z) <E 炎 这 and only 让 zi = 0. In these coordinates， 
denoting Y(Zz) = V(q) ford = 风 (z), we then have 


六 (zo,0) = 0， VfY(zo,0) =0 


by (3.2), and 本 
4(zo) := V2 TY(zo,0) is positive definite 


by (3.3). We now change coordinates by the near-identity transformation 
Wo 三 2Z0， 纪 三 AZz)zi 


Where the real factor /(z) Cnear 1 for zl near 0) is to be chosen Such that 


一 


1 
了 红 4(yo)9 王立 (zo,Z1). 


Since the right-hand Side equals 


Do 


一 一 1 
VY(zo,zZz1i) 一 V(zo,0) 一 ztVV(zo;0) 三 5214(zo)za 十 7r(Z) 


with r(z) = O(zill?), the choice 





does the trick. 











Weremark that Lemma 3.2 could be obtained as a corollary to the Morse lemma， 
for which we refer to Abraham 儿 Marsden (1978) and Crouzeix 贸 Rappaz (1989). 

The change to the local coordinates z = (Zo, Z1) Such that (dg) 王 0 直 and only 
这 zl 一 0fora=W(z),is not numerically constructive from the mere knowledge 
of an expression for the potential ，. However in many Situations the manifold ?7 
can be described by constraints g(dq) = 0, and zl = 9 can then be extended to a full 
set of coordinates. The above transformation from 2z to y can be done numerically. 
In the usual way, the transformation dg = X(V) of the position coordinates extends 
to a canonical transformation by setting py 三 X(V)7p for the conjugate momenta; 
See EXample VI.3.2. 

Solutions of (3.1) are in general oscillatory with frequencies of size ~ < 一. 
There exist， however，Special Solutions having arbitrarily many time derivatives 
bounded independently of es, which for arbitrary NV > 1 stay O(ewv ) close to a man- 
ifold Yes that has a distance O(e) to V. See Lubich (1993),，where also implicit 
Runge-Kutta methods for the approximation of the smooth solutions are studied. In 
this Section we are, however interested in approximating general oscillatory Solu- 
tions of bounded energy. 
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XIV.3.2 Transformation to Adiabatic Variables 


We start from a Hamiltonian (3.1) in coordinates (2, q) where the constraining po- 
tential is already of the form (3.4) for d = (ao,aqi). We note that for a system of 
bounded energy, we then have 9q1 = O(s). 

We now perform a series of canonical transformations that take the Hamiltonian 
into a form that is better Suited for a direct numerical treatment and for the er- 
Ior analysis of multiple time-stepping methods. The transformations are Similar to 
those for the time-dependent case treated in Sect. XIV.2.1, but here they appear in a 
permuted order. 


Transforming the Stiffness Matrix into the Identity，We write the Cholesky de- 
composition of the stifftness matrix as 


4(q) = C(qo) 一 C(oo) 
and change to variables 
4 三 g， 4=C(o)0 
along with the conjugate momenta 
区 O 人 二 下 
Do = Do 十 人 C(50) 六 ) D1， 六 三 Cao) D1. 


With the transformed mass matrix M(D = 瑟 ( 力 AM(,C( 加 六 )B(DT dor the 


matrix 妃 (9) that transforms 万 = 巨 ( 人 p) and the potential 7(9) = 7(0ao,C(0o)0D)， 
the Hamiltonian takes the Simpliftied form (we omit all tldes) 


1 _ 
瓦 = DAM(O)-Ip 十 


mr 
72 ai9l 十 LV7(9). (3.5) 


Eliminating Off-Diagonal Blocks in the Mass Matrix。 We write the mass matrix 


IT(d) as 
oo Mo 
Ad 一 
Cs 1 


With G(ao) = 一 Moo(azo,0)-1Moi(ao,0), we transform 
qd 三 2 二 Goz， 4 三 ， 
with the conjugate momenta 
两 =po+ (元 Go)6) m， 五 = 玉 +G(Gaormo 


This canonical change of variables eliminates Mol and Wio in the transformed mass 
matrix M(ao,0) and keeps the Schur complement on the block diagonal: with the 
Symmetric positive definite matrices 


XIV.3 Mechanical Systems with Solution-Dependent Frequencies 559 


Jo(zo) = Moo(zo,0)， Wi(zo) = (Ma 一 MoM00 Mol)(zo,0)， 


the transformation puts the Hamiltonian into the form (we omit all bars) 


于 1 到 1 
吾 = 了 p0Mo(go) po 二 5PLMa(go) 十 元 1gl 
+ 了 及 (O)P+U(o 二 G(oo)gu91) G.0) 
where 尺 is a smooth matrix-valued function Satisfying 


R(qo;0) = 0， G.7) 


Diagonalizing the Mass Matrix of the Fast Variables， We diagonalize 


Mi(oo)=Q(oo)2(ao) 一 Q@(oo) 


with the diagonal matrix fY2(qo) = diag(wj(ao)) of frequencies and an orthogonal 
matrix @C(dqo),which depends smoothly on do 让 the frequencies are separated. We 
transform 


d0 = 00， d1 一 Q(zo)9 


with the conjugate momenta 

要 日 放 交 SA 信 人 ~ \ 

加 =Do 十 人 Q( 交 )) Di， 严 =Q() D1 

d0 
The matriX 
机 人 
7O = (二 9( 轴 ) Q(O) 
d0 


is of size O(@G) but it is this expression which may become large near avoided 
crosSings of eligenvalues. We conslider the associated matriX 


~、、 (0 Xi 0 一 MT ITY 
X(9) < 人 | 0 本 Y7TMT ITY < (3.8) 


With a matriX 忆 ( 作 Satisfying (3.7), which is a Sum of the appropriately transformed 
previous matrix 尺 and the above matrix X, the Hamiltonian in the new variables 
(也 0) becomes (we omit all hats) 


1 去 1 | 
妞 = 了 20 Mo(4o) -20 十 5D1 (go) pl 十 元 q7 di 


二 DR(Op+U(+GQ@(oo;Q(o)ay) G.J9) 
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Rescaling Positions and Momenta。 We change to rescaled fast variables 
4 王 加 ， 和 一/202(g)72 人 
Cnote that gl = O(s) implies 员 = O(eL2)) with the conjugate momenta 
5 于 1/2 Dry \1/2 
Do 一 po 十 E 人 12(00) 刀 ) DPI， 万 = 02(00) 7D1. 


In the new variables, the Hamiltonian becomes (we omit the haceks on all variables) 


1 _ 1 1 
囊 = 了 DO Mo(go) po 十 元 PL2(oo)z 人 0 2(qo)q 
1 
十 于 有 (OP 十 U(T(do)g)， G.10) 


Where f j 
T EGGQIO 
了 < (人 cl/27 ) 二 ( 2000 】 
and RR(a) is a symmetric matrix of the form 


R9g) = Roo(qo,sM2g1) 。 e172Rol(go,s20l) 
ce-12Pio(qo,sl20) ce-1Rili(qosL201) 


with smooth functions Ri satisfying Rij(qo,0) 三 0. Therefore, the expression 
和 27R(q)p can be rewritten in the form 
工 
DR(OJP = sc(pogo) go 十 DLL(pogo) dl 
十 es 727(pl,Dl di;Do,go) 十 p(D,g)， G.1D) 


with a vector c, amatrix ZL, afunction7 thatis trilinear in pl1,Dl,9ql,andaremainder 
of size p(p;,dq) = O(s2) forpl, gl = O(sl/2), whose partial derivatives with respect 
to Dil, gl are of size O(s3/2), and with respect to po, go of size O(E2). 


了 quations of Motion. The differential equations now take the form 


1 
Do = 三 一 Vo (go(o) :po 十 UV(ao， 0)) 
1 1 
一 Ve (去 好 2(o)m 了 元 do2looo ) 十 万 (P,9) 
0 = Mo(qo) :po+og(p;9g) (3.12) 


人 


with the functions 


1 
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1 
上 】 = -Vs (5 2 R(OP 二 (CT(o)9g) 一 (go， 0)) 


人 】 = RdOP. 


We note the magnitudes 万 = O(sj,go = O(s) and 廊 = O(eL2) ,gl = O(eL2) 
in the case of Separated eligenfrequencies, where the diagonalization 1s smooth with 
bounded derivatives. By (3.11) we have (omitting the arguments D0, qo in c, 卫 ,7) 


万 = -el2e-To+eTVY2anbpipog) 一 <7277VU(do,0) 二 O(c2/ 9) 
9 = 9 G.13) 
where the functions wa and p are bilinear in their first two arguments. 


The System in Adiabatic Variables. We finally leave the canonical framework 
and transform to adiabatic variables as in (2.16). Along a solution (z( 划 ,9( 塘 ) ofthe 
System (3.12) we consider the diagonal phase matrix ( 蕊 defined by 


下 = 4(qo) with 4(qo) = 区 | 


With the constant unitary matrix of (2.14)，which diagonalizes the matrix in 
(3.12), we introduce the adiabatic variables 


刀 一 < 一 1/2 exp( 一 :5 * (3.14) 


and denote the inverse transform as 


DPI 121 一 -12 了 
0 < (上 < Texp( 9) (3.15) 


The differential equations (3.12) for pl, gl then turn into 
1 estp( 一 9 m | = /2 开户 二 se 1/29ig 
1 


with the arguments (po0, sl/2 疡 7, go0,EsLl2@17) in the functions 方 ,91. Inserting the 
expressions for 万 and 9l from (3.13), we obtain as in (2.22) and (2.23), with 


177-77 + 
the differential equation 
六 = exp(-28)JW(om)exp(25)7 G.17) 
1 :fa( 户 四 天 7; Do, do) 
于 总 (=-23) 工 ( 3.18 
NE (Pim7, Gan; po, go) 0 
- 届 (cloooo)+T(orvolo,0)+r G.19) 


with the remainder r(2o, q0, 已 7 Q17) = O(e). 
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Adiabatic Invariants. For a solution with bounded energy, both pl ( 芭 and di (性 雇 
(3.12) are of size O(sl/2) and hence 


We now integrate both Sides of the above differential equation from 0 to 上 The 
integral of the terms in (3.19) is O(sj, as js seen by partial integration Since 大 "( 娘 
is oscillatory with an O(s) integral and po, qo have bounded derivatives. 

We now Suppose that the eigenfrequencies wj(b := wj(qo() remain separated 
and bounded away from 0: there is a constant b > 0 such that for any pair wj ( 坟 and 
wk (加 with 了 夭 thelowerbounds 


lo 的 一 wk 的 [ 宇 6， wj 的 > 2 (3.20) 


hold for all 上 under conslideration. In this Situation, as in Sect.XIV.2.1, the integral 
from 0 to 上 of the term (3.17) is bounded by O(s)j, Since the matrix 卫 has zero 
diagonal. 

It remains to Study the term (3.18) with the bilinear functions w and p. This 
term has only oscillatory components it the following non-resonance condition is 
Satisfied: for all 7,K,! and all combinations of Signs， 














| 的 Lp 人 (如 wl 人 (| >0 (3.21) 


with a positive 0 independent of s. In this case, also the integral over the term (3.18) 
is of size O(s), and we obtain 














1 全 1(0) 十 O(s) for 了 寺 和 Const. (3.22) 
Icondition (3.21) is weakened to requiring that for all 7, KE 三 1. ,770 
wj 十 wk 蚊 土 whasafinite number of at most simple zeros (3.23) 


in the considered time interval, then the estimate deteriorates to (See Exercise ]) 
1 人 四 一 1(0) 十 O(eL2 for tt< Const. (3.24) 


The actions 
万 = GOG=Lm) (3.25) 


are thus adiabatic invariants: 
万 (人 三 万 (0) 十 O(s) for 寺 冬 Const (3.26) 


in case of (3.22), and up to O(sl/2) in case of (3.24). 
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The Slow System. Since the oscillatory energy equals 


1 工 一 
人 12(qo)D1 十 元 引 12(qo)91 三 2 wj(ao)， 


the differential equations (3.12) for the slow variables po, go become, up to O(e)， 


1 是 和 
Do = 一 Vo (5 20 Mo(do) :po 十 D(qo， 0)) 一 >》 万 Voouj(ao) 
和 1 


0 = Mo(ao)- :po0. (3.27) 


Compared with the constrained system with Hamiltonian 52D7M(q)-ID 十 U(d) on 
the configuration manifold ), the Slow motion ls thus driven by the additional poten- 
tial > 万 wji(dqo) depending on the actions 站. See also Rubin 放 Ungar (1957)， 
Takens (1980), and Bornemann (1998) for different derivations and discussions of 
the correction potential. 


Avoided Crossing of Frequencies and Takens Chaos. If the distance 0 of fre- 
quencies in (3.20) becomes so small at a point do( 如 that 62 < s, then there can 
again occur O(1) changes in adiabatic invariants 六 ,as in the Zener example of 
Sect.XIV.1.1. In the present Situation of solution-dependent frequencies, however， 
the level to which 六 Jumps after the avoided crossing, depends very Sensitively on 
the slow solution variables qo(t) through the terms exp( 士 2) ip (3.17). mn tum, the 
Slow motion of po, go after the avoided crossing depends on the new values of 万 
through (3.27). The effect is that the Slow motion depends very Sensitively on per- 
turbations of the initial values in the case of an avoided crossing; See Takens (1980). 
The indeterminacy of the Slow motion in the limit s 一 0 is termed 72Ke1s cpnaosy by 
Bornemann (1998). 





XIV.3.3 Integrators in Adiabatic Variables 


Along-time-step integrator for the oscillatory mechanical System with Hamiltonian 
(3.1) can now be obtained as follows: 


Solve the Slow System (3.27) in tandem with applying an adiabatic integrator 
(See Sect.XIV.1.2) to a Simplified equation for the adiabatic variables， 


2 2 
1 性 这 -9 (29 
7 exp( 三 exDp 三 7 
where Tris given by (3.16) with a simplified matrix 元: with vo0 = Mo(qo)-1po, let 


Flpo mg)= -2(o)12 区 | Go roo7Q(o) 2looj-172 


This matrix 了 captures the Principal terms, coming from the matrix Xol in (3.8)， 
which are responsible for a change of the adiabatic invariants due to an avoided 
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crossing as long as the frequency Separation condition (3.20) holds with a possibly 
E-dependent 5 六 s, e.g., with 6 ~ sL2 where O(1) changes occur in the adiabatic 
invariants. Because of the Takens chaos, it cannot be expected that Such an integrator 
yields a good approximation to“the”solution，but the method can approximate 
an almost-solution (having a Small defect in the differential equations) that passes 
through the avoided crossing zone, and it detects the change of adiabatic invariants. 
The properties of integrators of this type are currently under investigation (Lorenz 
久 Lubich 2000). 

Further we refer to Jahnke (2003, 2004b) for the construction and analysis of 
adiabatic integrators for mixed quantum-classical molecular dynamics, where Simi- 
larly anonlinear coupling of Slow and fast, oscillatory motions OocCUIS，. 


XIV.3.4 Analysis of Multiple Time-Stepping Methods 


The error behaviour of the impulse and molliftied impulse method applied to an os- 
cillatory Hamiltonian System (3.1) with well-separated frequencies can be analysed 
in the adiabatic variables in the Same way as we did in Sections XIV.2.3 and XIV.2.4 
for the case of tme-dependent frequencies. Analogous formulas and the Same con- 
clusions hold; essentially we need to replace the argumentt by ao in the appearing 
functions. However, their behaviour in the Situation of an avoided crossing with 
Takens chaos is presently not understood. 


XIV.4 了 上 Xercises 


1. Show that ， 
上 王 1/(m 十 1) 
exp(- g(s))ds Of(e ) 


让 入 := 由 has finitely many zeros of order at most 7n in the interval [0, 媚 . 
Zi1f Use the 11etjod of slationa1y Phase; See, e.g., Olver (1974) or van der 
Corput (1934). 

2. Show that the adiabatic variables 7( 芭 of (1.4) remain approximately constant 
also in the following cases of non-Separated eigenvalues: 
(a) a multiple eigenvalue 和 ji(t of constant multiplicity mmz for al 上 and the 
orthogonal basis vj 1 人 (UVjm 人 (加 of the corresponding eigenspace chosen 
Such that the derivatives bj (tare orthogonal to the eigenspace for all 已 
(b) acrossing of eigenvalues, Aj() = AR(t) with Ai) 夭 Xe(t),for which 
the eligenvectors are Smooth functions of t in a nelighbourhood of 妃 ; see also 
Born 贸 Fock (1928) for crossings where 和 j 一 AK can have Zeros of higher 
Imultiplicity. 

3. Let the differential equation (1.1) with smooth skew-hermitian 2(t be trans- 
formed locally over [to,to 十 阁 to z(b = exp( 一 上 2sJy(b,so that 


10. 
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二 exp( 开 Zr] (2 的 一] exp(2Z]a 


with 2 = G(to 十 有 /2). Consider the averaged midpoint rule 
1 广 S 这 S 1 
21 一 20 十 exp( 一 =) (2Z(s) -2 expP(=2) ds 了 (20 十 21)，(4.1) 
0 


where Zb is the quadratic interpolation polynomial through 2Z (to), 2., 2 
Show that the local error 2 - z(i) is of size O(14/e2), which is O(P2) only 
for 玫 = O(e). Explain why the error bound cannot be improved to O(j2) for 
尹 王 O(sc) with a < 二 . 

ZI See the proofs of Theorems 2.1() and 3.1 in Hochbruck 儿 Lubich 
(1999b), cf. also Iserles (2004). 


. In the Situation of the previous exercise, let LU be a unitary matrix of elgenvec- 


tors of Z, and jlet 万 人 be the diagonal matrix containing the diagonal entries 
of (2 信 一 2 ) L. Find a modification of the above averaged midpoint rule 


by terms that use only 万 ( 划 , such that the local error is O(j2) for 尹 < <s3/4 证 
the eigenvalues of Q are all separated by a distance 0 independent of <. 


. Compare the error behaviour of the averaged midpoint rules (1.12) and (4.1) 


near the avolded crossing of the eigenvalues in the Zener matrix (1.9). 


. Formulate Symmetric modifications of the adiabatic integrators (1.12) and 


(1.13) that use function evaluations at the grid points 如 and 如 +1 instead of 
如 十 1/2， 


.Consider the differential equation y = 帮 V) 十 9( with a smooth function 


jy) and a function g( 轨 = OU) with g(b = O(0-) with respect to a small 
parameter 0. For the modifed midpoint rule 


十 
呈 +Mf( 色 5 生 )+ 909 几 
to 





show that the local error satisfies 1 一 y( 石 ) = O(121/6). 


.Write the Hamiltonian System (XIJIIUI.9.2) in adiabatic variables and relate this to 


the first terms of the modulated Fourier expanslion. 


. Compare the impulse method of Sect.XIV.2.3 with the method based on the 


Splitting 


瓦 = (iprM(O -ip+ 声 q74(D9j) + (ToD+ 瑟 ) 


Show that Theorem 2.6 remains valid for the choice s 扫 = 0 in (2.37). This 
Corresponds to the projection to the constraint manifold in the mollified impulse 
Imethod as proposed by IZaguirre, Reich 儿 Skeel (1999). 


Chapter 入 YV. 
Dynamics of Multistep Methods 


Multistep methods are the basis of important codes for nonstifft differential equa- 
tions (Adams methods) and for stiff problems (BDF methods). We Study here their 
applicability to long-time integrations of Hamiltonian or Teversible Systems. 

This chapter starts with numerical experiments which illustrate that the long- 
time behaviour of classical multistep methods ls in general disappointing. They ei- 
ther behave as non-Symplectic and non-Symmetric one-step methods，or they ex- 
hibit undesired instabilities (parasitic Solutions). Certain multistep methods for Sec- 
ond order equations or partitioned multistep methods, however have a much better 
long-time behaviour. They are promising methods, because in a constant Step SiZe 
mode they can be easjly Implemented, and high order can be obtained with one 
function evaluation per step. We characterize Such methods by studying their under- 
lying one-step method, their symplecticity, their conservation properties, as well as 
their long-term Stability. 


XV.1 Numerical Methods and Experiments 


We present the numerical methods treated in this chapter, and in numerical experi- 
ments we look at their behaviour on Hamiltonian Systems. 


XYV.1.1 Linear Multistep Methods 


For first order systems of differential equations y = Fo), linear multistep methods 
are defined by the formula 


龙 天 
>》 oj = 天 》 Dion (1.1 
J=0 汪 0 


where aj, 0j are real parameters, ak 夭 0, and |ao| 十 |6o| > 0. For an application 
of this formula we need a starting procedure which, in addition to an initial value 
4(to) = yo,provides approximations 1 .. ,WitoW(to 二 JW(to 十 (KE 一 1)7). 
The approximations yn to Vy(to 十 7 户 ) form 过 有 can then be computed recursively 
from (1.1). In the case 大 = 0 we have an explicit method, otherwise it is implicit 
and the numerical solution yn has to be computed iteratively. 
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Since the fundamental work of Dahlquist 
(1950) it is common to denote the generating 
polynomials of the coefficients by 


天 演 
p(O= > oajG， oo(O= >》 016G5. 
J=0 J=0 


For the classical theory of multistep meth- 
ods we refer the reader to Chap.II of Hairer， 
Ngwrsett 狠 Wanner (1993)，We just recall 
Some important definitions. 


Order. A multistep method has order 7 让 
when applied with exact Starting values to 
the problemy 三 妇 (0 科 dg 气 7),itintegrates 





Germund Dahlquist 


the problem without error. This is equivalent 

to the requirement that 
p(e") 一 pa(e) = O(PTD) for 下 一 0. (1.2) 
Stabijlity，Method (1.1) is stable 让 when applied to y = 0, it yields for all 
0,...,W -1 a bounded numerical solution. This is equivalent to the requirement 


that the polynomial p(C) Satisfies the root condition, i.e., all roots of p(C) = 0 sat- 
jsfy |6| 么 1 and those on the unit circle are Simple roots. The method is called 
Strictiy stapje, 这 all roots are inside the unit circle with the exception of 5 = |. 


Convergence. If a multistep method is stable and of order 7 > 1l, it is convergent 
of order r for all sufficiently smooth problems. This means that, asSuming starting 
approximations with an error bounded by O(j), the global error satisfies yn 一 
zy(t 如 十 np =O(j7) on compact intervals 7 六 苹 工 . 


Symmetry， If the coefficients of a multistep formula (1.1) Satisfy 
QK 一 7 二 一 Q 放 OK 一 0 for all 了 7， (1.3) 


then the method is called Symmetric. Condition (1.3) implies that for every Zero (5 
of p(G) also its inverse C-1is a zero. Hence, for stable symmetric methods all zeros 
of p(G) are simple and lie on the unit circle. 


了 xample 1.1， We consider the pendulum equation (I.1.13), and we apply the fol- 
lowing multistep methods: the 2-step explicit Adams method 


3 工 
oo 一 二 At 一)， (1.4) 


the 2-step backward differentiation formula (BDP) 








3 工 
了 Vn 十 2 2Vn+1 yn 一 尺 j 上 2， (1.3) 
and the (2-step) symmetric explicit midpoint rule 


1 Germund Dahlquist born: 16 January 1925 in Uppsala (Sweden), died: 8 February 2005. 
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explicit midpoint 


Adams expl 厂 一 2 


Fig. 1.1. Solutions of the pendulum problem (I.1.13); explicit Adams with step Size 几 三 
0.5, initial value (po,qo) = (0,0.7); BDEF with step size 玉 = 0.5, initial value (po, qo) 三 
(0,0.95); explicit midpoint rule with 中 = 0.4 and initial value (zo,qo) = (1.1,0) 


or 一 加 十 27 户 11 (1.6) 


For all methods we take 册 = yo 十 几 jo as the approximation for V(to 十 由. The 
results of the first 108 steps are Shown in Fig.1.1. We observe that the first two 
methods, as expected, behave Similarly as the explicit and implicit Euler method 
(the numerical solution Spirals either outwards or inwards). This will be rigorously 
explained in Sect.XYV.2.1 below. However as might not be expected, the Symmetric 
method (1.6) does not behave like the implicit midpoint rule (cf. Fig.I.1.4), it Shows 
undesired increasing oscillations (Parasitic Solutions). 

After this negative experience with classical multistep methods,，the obvious 
question ls: are there multistep methods which have a long-time behaviour that ls 
comparable to Symplectic and/or Symmetric one-step methods? 


XV.1.2 Multistep Methods for Second Order Equations 
Many Important Hamiltonian Systems are Second order differential equations 


乡 = 帮 y)， (1.7) 


where the force jis independent of the velocity 7. Introducing the new variable 
一 四 we obtain the system = 网 2 = jW) of first order equations. If we apply 


a mnultistep method (1.1) with generating polynomials Am(6) 王 oa76G7 and 
ax(() 一 > DG7 to this system, we get 


开 * 下 二 天 开 * 
0 
洒 二 人 7 了 二 全 芝 汪 和 


An elimination of the v-variaples then yields 
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开 天 
> .ojgnH = 有 尼 > DJ， (1.8) 
57=0 7 二 0 


where 大 一 21*， 0(0) = 0*(0)” and ao((C) = ax(GO)2?. We consider here methods 
(1.8) which do not necessarily originate from a multistep method for first order 
equations, and we denote the generating polynomials of the coefficients ay and [5 
again by p(C) and (6). From the classical theory (see Sect. II.10 of Hairer Ngrsett 
贸 Wanner 1993) we recall the following definitions and results. 


Order A method (1.8) has order r 夺 its generating polynomials Satisfy 
pfle) 一 jp2cole 人 =O2) for 玫 一 0. (1.9) 


Stabijlity. Method (1.8) is stable 这 all zeros of the polynomial p(C) satisfy |6| < |， 
and those on the unit circle are at most double zeros. Observe that for methods 
originating from (1.1) all zeros are double. The method is called strictiy staple, 于 
all zeros are inside the unit circle with the exception of 5 = 工 . 


Convyergence. Ifamnultistep method (1.8) is stable, of order7 之 1 and ifthe starting 
values are accurate enough, the global error satisfties yn 一 V(to 十 nj) = O(1) on 
compact intervals 7 刀 区 荆 . 


Symmetry， If the coefficients of (1.8) Satisfy 
CQK 一 7 二 Q7 [7 一 [0 for all 7， (1.10) 


then the method is symmetric. Again, for every zero 6C of p(C) the value C 一 Lis also 
a Zero. Hence, stable symmetric methods have all zeros of p(GC) on the unit circle 
and they are at most of multiplicity two. 


Dahlquist (1956) noticed that double zeros of p(6) on the unit circle can lead to 
an exponential error growth. Lambert & Watson (1976) analyzed in detail the appli- 
cation of (1.8) to the linear test equation 艺 = 一 w2%. They found that with symmet- 
iic methods for which p(6G) does not have double roots on the unit circle other than 
4 三 1,thenumerical solution remains close to aperiodic orbit (for sufficiently small 
Step sizes). For example, the Stormer-Verlet method yn 1 一 2 十 1 三 712 广 
Satisfies this property for 0 < Ahw < 2 (See Sect.I.S.2). The study of the long-time 
behaviour of symmetric methods (1.8) was then put forward by the article of Quin- 
lan & Tremaine (1990), where an excellent performance for Simulations of the outer 
Solar System is reported. 


了 xample 1.2， We consider the Kepler problem (L.2.2) with initial values (1L.2.11) 
and eccentricity e 一 0.2. We apply the following three methods with constant step 
size 几 一 0.01 on the interval of length 27r . 105 (ie., 105 periods): 


7 5 工 1 
人 和 HH 一 2 二 有 (了 ta 一 五 记 2+3Pt 一 五 广 ) 





4 4 4 
(B) or 一 2 十 甸 王 凶 (Sas 十 本 加 +2 十 1) 











7 工 7 
(C) mm 二 4 一 2%n+3 机 22m 二 2 人 22mn+1l 守 2m 一 有 P( 六 as > 本 加 +2 守 也 ) 。 
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error in total energy 


人 ) 
metnod 《 
near Bow 














10-8 method (C) 
10720 | | | 1 1 1 | | | LT | | | | | | | | | | | 万 
101 10” 103 104 105 


Fig. 1.2. Error in the total energy for the three linear multistep methods of Example 1.2 
applied to the Kepler problem with e 一 0.2 


Allthree methods are of order7r = 4; method (A) is strictly stable, whereas methods 
(B) and (C) are Symmetric. For method (B) the p-polynomial has a double root at 
4 三 一 1, for method (C) it does not have double roots other than 1. Starting values 
1,y2, and ya are computed very accurately with aphigh-order Runge-Kutta method. 

The error in the total energy is plotted for all three methods in Fig. 1.2. On 
the first 10 periods, all methods behave Similarly and no error growth is observed. 
Beyond this interval, method (A) shows a linear error growth (as it ls the case for 
non-Symplectic and non-Symmetric one-step methods), method (B) has an exponen- 
tial error growth, and for method (C) the error remains bounded of size O(j) on 
the whole interval of integration. One of the aims of this chapter is to explain the 
excellent long-time behaviour of method (C). 


Stabilized Version of (1.8). Due to the double zeros (of modulus one) of the char- 
acteristic polynomial of the difference equation > 7 a7gyn+7 二 0,， we have an un- 
desired propagation of rounding errors (especially for long-time integrations). To 
overcome this difficulty, we split the characteristic polynomial p(C) into 


P(9) = O4(G) DB(9)， (4.11) 
Such that each polynomial 
开 A 开局 
paO=》 al6， paB(O=》 af 6 
让 3 


has only Simple roots of modulus one. Introducing the new variable PuUn :一 


> 7 af on+ 放 the recurrence relation (1.8) becomes equivalent to 


太 4 大生 天 

4 已 
有 
J=0 J=0 J=0 


This formula，which for the Stormer-Verlet Scheme corresponds to the one-step 
formulation ([L.1.17), is much better Sujited for an Implementation. If the splitting 
is Such that 0 (1) = 1, the discretization (1.12) is consistent with the first order 
partitioned system 7 =vD = Fy). 
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XV.1.3 Partitioned Multistep Methods 


Motivated by the stabjlized version (1.12) of multistep methods for second order 
equations, let us conslder general partitioned Systems of differential equations 


= 7 v)， 卫 三 9(U)， (1.13) 


Where, needless to Say, y and v may be vectors. The idea is to apply different multi- 
Step methods to different components. We thus get 


天 天 

人 4 人 4 
> ai 7 一》 有 大 珊 交 人 Co 人 
J=0 J=0 5 一 0 


where 万 = jnvn) and gm = 9(Vn un). We can take the same for both meth- 
ods without loss of generality, if we abandon the assumption |ao| 十 |Bo| > 0. 

Such a method is of order y, 让 both methods are of order 7. It is Stable (strictly 
stable, Symmetric, . . .), 寺 both methods are Stable (strictly Stable, Symmetric, . . .). 


了 xample 1.3， For our next experiment we use the Symmetric methods 


(A) ， n+3 一 yn+2 十 yn+l 一 2n 三 丈 ( 万 +2 赴 万 +1) 
(B) : Un 二 3 一 Vn 二 1 一 2 刀 0n 二 2. 


Both methods are of order 2,， and their p-polynomials 
pa(O = 人 《-D(2+Dandon(O = (CDK 二 + 《sy》 


do not have common zeros with the exception of 5 = 1 


(1.15) 











0<t<10 
750 < 了 寺 < 760 
00 < 了 上 < 7510 










method (AB) 











0<t 上 <10 
750 < 了 寺 < 760 





method (A) 























Fig. 1.3. Three versions of the methods (1.15) applied with step size 尺 = 50 (days) to the 
outer Solar System. For method (B) only the numerical orbits of Jupiter and Saturn are plotted. 
The time intervals are given in units of 10 000 days 
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We choose the outer Solar System with the data as described in Sect.I.2.4, and 
we apply the methods in three versions: (人 as partitioned method (AB), where the 
positions are treated by method (A) and the velocities by method (B); () method 
(A) is applied to all components; (ii) method (B) is applied to all components. 
The numerical results are Shown in Fig.1.3. Whereas the individual methods Show 
instabijlities on Tather Short time intervals, the partitioned method gives a correct 
Picture even with a large step Size 玫 = 50. 


XV.2 The Underlying One-Step Method 


Much insight into the long-time behaviour of multistep methods can be gained by 
relating their numerical solution to one-step methods. This then allows for an appli- 
cation of the consliderations of the preceding Sections. 


XV.2.1 Strictly Stable Multistep methods 


It was a Surprising result when Kirchgraber (1986) proved that strictly stable multi- 
Step methods are essentially equivalent to one-step methods. Although this one-step 
method 18S“quite exotic”(Eirola 儿 Nevanlinna 1988), it ls the key for a better un- 
derstanding of the dynamics of strictly stable methods. 


Theorem 2.1 ( 玫 irchgraber 1986). Comsider Ga stricty Staple Lear 711istep 
1Iet1jpod (1.7) GDPPLied wii G SUNciemty SQL Step Size 及 T1e1 1 加 eye exis1s da oO1e- 
SteD Jie1jod GPSUCH 1Pat Jor Startz8 apProxi1aatio11g coO1IPUied py Vi 三 下 (yo)， 
了 7 三 1 ...; 大 一 1 1pe 11terical solufion or identical io 加 at opiained Dy 加 e 
oO11e-sifep 11e 妨 od ie yn+l 三 环 ( 0) Jora17 > 过 0. 


Proof The idea is to reformulate the multistep method (1.1) in such a way that the 
Invariant Manifold Theorem of Sect.XII.3 can be applied. TIo keep the notation as 
Simple as possible, let us consider the case 大 三 3. 

We write the method in the form 





2Zm 十 3 一 Q2 一 41 一 40 Zn 十 2 (yn Zn 十 1， Vn+2) 
oa =|1 0 0 | 5 0 C.D 
Vn+1l 0 1 0 om 0 





with w 三 ai/ak, and we transform the appearing matrix 4 to Jordan canonical 
form .J 一 人 -147. We thus get 





| 0 0 2m 十 2 
ZI=|0 da do 2+hG 7 思 =TT|o| CC2) 
0 ao2l ao22 Zn 


Since the method is strictly stable, 1 is a Simple eigenvalue of 4, and all other 
eigenvalues are less than 1 in modulus. Consequently, the matrix 刀 = (qij) Satisfies 
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DPI < 1Lin asuitable norm. Partitioning 2 = (6 into its frst component 
Cn and the rest (collected in 11)，we see that (2.2) is of the form (XIL3.1) with 
ZLzziZLazyZoz ofsize O(P),and Lo =||D|| < 1.Theorem XIL3.1 thus yields the 
existence of a function 7 = s(6) such that the manifolds 


0 :ERe} and 人 :ERe} 


are invariant under the mappings (2.2) and (2.1), respectively. The function s() is 
Lipschitz continuous with constant 入 = O( 刀 ). 

Since the first column of 7 ，which is the eigenvector corresponding to the 
eigenvalue 1 of 4, is given by (1,1,1)7, the last component of 人 (。) Satisfies 
/一 上 +9(6) where q(o is Lipschitz continuous with constant O(P). By the Banach 
fxed-point theorem this equation has a unique solution < = 7(Z 妇 .Consequently, the 
manifold 人 4 can be parametrized in terms of y as 


人 人 = 人 ( 玫 ( 轨 于 ( 办 妇 7E 开 “ 


Its invariance under (2.1) implies that 


(人 一 02 一 01 一 00 感 (y) (0 2 (0)) 
Gu ( 信 全 1 0 0 Go 十 几 0 
站 0 1 0 小 0 


and consequently 分 = 于 (y) and 环 ( 信 = 萝 ( 四 ,so that 萝 () = 于 (四 ). This 
holds for all y, and thus proves the statement of the theorem. 














了 xample 2.2， For a scalar linear problem 7 = Ay, the application of a multistep 
method yields a difference equation with characteristic polynomial p(C) 一 几 Xa(C). 
Denoting its zeros by G1(PA)).…，,Gk(PRA)，where G1(0) = 1 and 60)| < 1 tor 
了 过 2, the numerical solution can be written as (assuming distinct Cj(AA)) 


Un = c1GT (AAA) 十 czG2(PA) 十 .十 ckGR (PP 和 A). 


The coefficients cl,...,ck depend on AAA and are determined by the starting ap- 
proximations yo,...,W 1. In this Situation the underlying one-step method is the 
mapping yo mm G1(PA)V0. Observe that G1(z) is in general not a rational function as 
weare used to with Runge-Kutta methods. 


Remark 2.3 (Asymptotic Phase). For arbitrary yo, 1,...,VK_1 close to the ex- 
act Solution, there exists 必 Such that the multistep solution {y} and the one-step 
Solution {? 罗 }， given by 541 三 歼 ()， approach exponentially fast, i.e.， 


| 一 天 | < Const forallm >0 (2.3) 


with some p satisfying 0 < D < 1 (See Exercise XIIL.3). This is due to the attractivity 
of the invariant manifold 人 7). A proof is given in Stoffer (1993), and it is based on 
techniques of Nipp 多 Stoffer (1992). This result explains why strictly stable linear 
multistep methods have the same long-time behaviour as one-step methods. 
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Jn the context of “geometric numerical integration”we are mainly interested im 
Symplectic and/or symmetric methods which, for linear problems, are characterized 
by the condition G1( 一 2)G1(z) 三 1 (see Sect.VI.4.2). This, however, is only possible 
for symmetric multistep methods (Exerclse 1) which cannot be strictly Stable. 


XV.2.2 Formal Analysis for Yeakly Stable Methods 


The proof and the statement of Theorem 2.1 break down as Soon as at least one root 
of p(C), different from 1, has modulus one. Moreover Example 2.2 shows that we 
cannot expect a property like (2.3) with P < 1. All we can hope for is to find an 
underlying one-step method as a formal series in 凡 .Surprisingly, this provides a lot 
of inslght into the long-time dynamics of weakly stable multistep methods. 


Theorem 2.4，Co7zsider a iear 712Ultistep 1ietpod (1.1 da1d assU1ie 1 6 一 工 全 
asil8le Joof of 0(0) = 0. Te tere exis1s Ga Uligue /jaal expa11sio7z 


By) = 十 Pda( 仿 十 jd2(g) 十 .… (2.4) 
SCP 1Pat 


太 天 
>》 oj 到 ( = 六 》 (到 ( 们 )， 
J=0 J=0 


Wjere iden1ity 1 adejstood 胡 1 加 e ye11ge ofjoraal Power se1iey 六 几 
Tripe 1IUl1istep 11etpod 18 of order 六 1e1l also 1je zideriyi8g o1le-step 11etjpod 
ioforderm ie Gy) 一 op = O(rT+D)， 


The formal series for 环 ,(V) is called“step-transition operator” in the Chinese 
literature (See e.g., Feng (1995), page 274). We call it “underlying one-step method . 
Notice that this theorem does not require any Stability assSumption. 


Proof Expanding 四 (2) and 大 ( 环 ( 切 ) into powers of 几 , a comparison of the co- 
effhcients yields 





Pd = DJO) 
ZeO) = 一 全 册 省 2 仿 +wODPOOo 3) 
AU)d) 三 …， 


Where the three dots represent known functions depending on derivatives of 太 Z) 
and on di(V) with < 7. Since 0(1) 夭 0 by assumption, unique coefficient func- 
tions dj(V) are obtained recursively. The statement on the order follows from the 
fact that the exact fow 2n(y) has a defect O(Pr+l) in the multistep formula. 














The computation of the previous proof shows that the Series (2.4) 1 a B-SseriesS. 
This follows rigorously from the results of Sect. II.1.4. Whereas the B-Series Tep- 
resentation of Runge-Kutta methods converges for sufficiently small P, this is in 
general not the case for (2.4); See the next example. 
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了 xample 2.S， Consider a consistent two-step method 


Q2Vn+2 十 Q1Vn+l 十 QoVn 二 ](B2 记 + 十 Dj 思 +1 十 D0 思 )， 


and apply it to the simple system 7 = 四 ,二 = 1. The y-component of the under- 
lying one-step method then takes the form 


Er 人 (too) 三 加 十 》 屯 oj710-D(to)， (2.60) 


JI 之 1 
Putting jb = et yields 


poe% 二 Bies 十 B0 
oa2(1 十 es) 十 al 





4(O9= > ac 一 = 


IJ 之 1 


forthe generating function of the coefficients aj. Since this function has finite poles， 
the radius of convergence of 4((C) is finite. Therefore, the radius of convergence 
of the series (2.6) has to be zero as Soon as JJ)(t0) behaves like 刀 1 A7 (this is 
typically the case for analytic functions). Independent of the fact whether the method 
is Strictly Stable or not, the series (2.6) usually does not converge. 


Both，Theorem 2.1 and Theorem 2.4,， extend in a straightforward manner to 
partitioned multistep methods (1.14). Io get analogous results for multistep methods 
(1.8) for second order differential equations, one has to introduce an approximation 
for the velocityv 三 %. This will be explained in more detail in Sect.XV.3 below、. 


XV.3 Backward Error Analysis 


The backward error analysis for multistep methods (Hairer 1999) ls presented in 
two StepS: 


e for "smooth”numerical solutions (obtained by the underlying one-step method); 
e for the general case. 


The idealized Situation of no parasitic terms gives already much insight into conser- 
vation properties of the method (See Sect.XV.4). The Study of the general case 1S， 
however neceSSary for getting estimates for the parasitic Solutions (Sect.XV.3), SO 
that rigorous statements on the long-ttme behaviour are posSible. 


XV.3.1 Modified Equation for Smooth Numerical Solutions 


The formal backward error analysis of Chap.IX could be directly applied to the un- 
derlying one-step method of Sect. XV.2.2. However due to the non-convergence of 
the series for @G1j(y), diffculties may arise as Soon as Tigorous estimates are desired. 
We prefer to derive the modified differential equation directly from the multistep 
formula and thus avoid the use of the underlying one-step method. 
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Theorem 3.1， Comsider alizear7altistep 1aetpod (1.1 and assume 1pat p(1) = 一 0 
a1d 0 (1) 三 a() 天 0.71en tpere exist Unigue 1-inadependent Jinctions 方 (7) SC 
11ak Jor eyery 1runacatiomn index Nevery solution of 


六 = jg 十 六 户 轨 十 性 户 轨 二 十 迪 开 ArO) (3.1) 
SG1is1jres 
天 开 
>》 oajgylt 二 入) = 天 》 Bijt+7) + OU G.2) 
IJ=0 7 一 0 


太 1Pe altistep 11etpod 1 oforder 记 加 en 方 (四 ) 三 0Jjor2 芝 7 挟 7 太 ipe maetpod 
Sy11I111et1ic, 加 e1a 方 (V) 三 0Joralleven 7 sotpafrtpe7modified equation (3.7) pas a1 
EXDG11SiO7 17 eye1 PoWe1Sy or 7 


Pioof Using the Lie derivative (Dig)(y) = 9 (上 户 (y) (with 广 () = FJ)) and 
万 = Di 二 7PDs 二 71Ds 十 .the solution of (3.1) with initial value y 人 = Y 
satisfies y(t 十 77j) = ez 十 ONT+T1) and FT7D) = ez2DPF 人 十 ONT+DH) 
(by Taylor expanslion). We thus have 


pe = hole )j TO G.3) 
With the expansion zolfez)/p(ez) = 二 Hz 十 Haz2 十 ... this becomes 
六 = (上 MPD+H12D2 二 .)FJ 上 +O(O). (3.4) 
A comparison with (3.1) yields 方 (y) = Fo),and 


方 人 = > >》 (5 也) (W) (3.5) 


1 之 1 刀 十 .… 十 思 一 7 一 工 














for7 > 2, which uniquely defines the functions 方 (y) in arecursive manner. 


Lemma 3.2. 矿 FJ) 5 aaatytic and poxded py MT 不 BR(o), tpen we jave 


MA 人 NI 
| 疡 人 <wM( 2 jr lol<R2 6G9 


WhereG1Q71 depe1id o1zly on 加 e coejicients ai Dj of tpe Ustep 1aethod. 


Prooj The estimate (3.6) ls obtained as in the proof of Theorem JIX.7.3. We just 
Sketch the main idea in the notation used there. With 6 = 民 /(2(J 一 了)) we have 
|| 方 上 和 6027， where the generating function pg(G) = >)>1 bjG2 of the 0 satisfies 


O= 公 (+olxO， 


1>1 


By the implicit function theorem, b(G) is analytic and bounded in a disc of radius 
cO/M centred at the origin (c is a positive constant depending only on the coef- 
ficients of the multistep method). The estimate (3.0) then follows from Cauchy's 
inequalities as in the proof of Theorem IX.7.3. 
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It is remarkable that,，although the Taylor series of the underlying one-step 
method generally diverges, the _ coefficient functions of the modifted differential 
equation Satisfy the Same estimate as for Runge-Kutta methods. This enables us 
to prove an analogue of Theorem IX.7.6 which, for one-step methods, is the main 
ingredient for exponentially small error estimates. One can prove that for Suitably 
chosen V = N(P) and for 灵 和 Po/4 with Po = 尺 /(e7AM), the solution of (3.1) 
Satisfies 


开 开 
| 袜 se+ 疝 -po7G6+ 闪 中 <iaMeroA 
JI 一 0 放 0 


where 7 depends only on the multistep formula. The proof of this statement ls Sim- 
ilar to that of Theorem IX.7.6. We Skip details and refer to Hairer (1999). 

For strictly stable multistep methods, Theorem 2.1 together with the Invariant 
Manifold Theorem XI.3.2 thus imply that the underlying one-step method is expo- 
nentially close to the exact solution ofthe truncated modified equation. The parasitic 
Solution termas are raplidly damped out by the property (2.3) of asymptotic phase. The 
Same conclusions as for one-step methods can therefore be drawn. 

For Symmetric methods the Situation ls not So Simple. One has to Study the par- 
asltic Solution components to get information on the long-time behaviour of the 
numerical solution of (1.1). The basic techniques will be prepared in Sect.XYV.3.2. 


Partitioned Mujltistep Methods. The extension of the modified differential equa- 
tion to methods (1.14) is Straightforward. There exist functions 方 (y,v) and gj(v) 
Such that the exact Solution of 

V 一 JU) 十 疡 户 (y) 填写 十 jw(yo) 


站 (3.7) 
眉 一 go) 十 jga(0o) 十 .十 PT1gw() 


satisfies the multistep formula (1.14) up to a defect of size O(PN+1) .The coefficient 
functions can be computed by comparing (3.7) to 


六 = (二 MOPDPTAD2T JI+OON) 二 
忆 


十 127D 上 Hi2D2+. JJg( TOON)， 














where the real numbers 1 43) are given by zat4)(ez)/p(4)(ez) = 工 十 


水 
Mgz 上 Hz2 十 .andby za(B)(ez)/p(B)(ez) 一 1 十 赂 2)z 十 HB)z2 十 


Tespectively. The Lie operator is defined by 万 = Di1 十 几 Ds。 十 1.Da 十 .…., where 
( 忆 并 )(u) = 本 (人 ) 方 (uv) 十 到 (yuv)9j(v), and it corresponds to the time 
derivative of solutions of (3.7). 


and 内 


Multistep Methods for Second Order Differential Equations，The method (1.8) 
for differential equations y = jy) can be treated in a similar way. In the absence 
of derivative approximations we get a modified differential equation of the Second 
order 
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六 = 十 户 (信念 二 十 辽 J( 人 7 (3.9) 


where the perturbation terms also depend on V. Its exact solution Satisfies the multi- 
step relation (1.8) up to a defect of Size O(A 二 让 (3.9) is equivalent to 


疙 一 (LTHMAhD+Hpji2D2 十 ..)FJ) 二 OO )， (3.10) 


where zZz2o(ez)/plez) = 1+HzZ 二 Hz2 十 ..，and the time derivative is 
given by the Lie operator 万 = Di + AD + 12Da 十 .with (Diz)( 信 = 
瑟 ( 凡 9 细 2 十 琴 ( 轨 切 帮 gg) and (证 )( 切 三 古人 人 切 方 加 人 for7 过 2.Acom- 
parison of equal powers of 灵 in (3.9) and (3.10) uniquely defines the coefficient 
functions 方 (2). 

Ithe multistep method (1.8) is complemented with a difference formula for 
approximations of the derivative 风 三 V at grid points， 


1 
| 
一 261gn4， (G3.11) 
5 一 一 ! 


we get an additional modified differential equation 
4 一 (1 十 7D 十 2 了 2D2 十 ..)7. (3.12) 


The coefficients zj are given by ZIb(ez) = 工 十 7Z 十 zz22 十 ..，wWhere 
0(0) = 温 可 1 61767. For given y, this relation gives a formal one-to-one correspon- 
dence between v and V. Consequently, the differential equation (3.10) combined 
with (3.12) can be considered as a first order differential System for the variables V 


and 1. 


XV.3.2 Parasitic Modified 上 Equations 


In practice, due to the necessity of starting approximations V1,...,VK_l,the numer- 
ical solution of a multistep method does not lie on a solution of (3.1). For methods， 
where initial perturbations are not damped out Sufficiently fast (cf. property (2.3) 
of asymptotic phase), an additional investigation ls therefore needed for the study 
of the propagation of perturbations in the starting approximations. Let us Start with 
two illustrating numerical experiments. 


了 xample 3.3， Consider the explicit, linear 3-step method 


yn+3 一 yn+2 十 yn+l 一 2r 三 及 万 +2 让 万 +1)， (3.13) 


with characteristic polynomial p(C) = (6 一 1)(C2 十 1),and apply itto the pendulum 
equation ([.1.13). For a better illustration of the propagation of errors we consider 
starting approximations V1,y2 that are Tather far from the exact Solution passing 
through yo. The result is Shown in Fig. 3.1. We observe that the numerical solution 
does not lie on one Smooth curve, but on four curves, and every fourth solution 
approximation 1s on the Same curve. 
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Fig. 3.1. Numerical solution of (3.13) applied to the pendulum equation. The initial approx- 
imations yo 王 (1.9,0.4), yi 三 (1.7;,0.2), ya = (2.1,0) are indicated by black bullets; the 
Solution points ys, V7, y11;,.. .in grey 























Fig. 3.2. Numerical solution of the explicit midpoint rule (3.14) applied to the pendulum 
edquation. The initial approximations yo 三 (1.9, 0.4), Wi 三 (1.7, 0.2) are indicated by black 
bullets; the solution points y2, V4, y6,.. .in grey 


This example Shows an Unexpected good long-time behaviour. Although the 
starting approximations are far from a Smooth solution, the distance of the numeri- 
cal approximations to a Smooth solution curve does not increase. This is, however， 
not the typical Situation as can be Seen from our next experiment. 


了 xample 3.4，We consider the explicit midpoint rule 


yn+2 一 yn 一 2 几 让 (3.14) 


which has p(C) = (5 一 J)(C 十 1) as characteristic polynomial. This tme, the nu- 
merical Solution (See Fig.3.2) lies on two Smooth curves. In contrast to the previous 
example, an unacceptable linear growth of the perturbations can be observed. 


To be able to explain this behaviour of the multistep solutions, we complement 
the analysis of the modifed equation for Smooth numerical solutions with So-called 
parasitic modified edquations. This theory has been developed by Hairer (1999) for 
first order differential equations, and extended to Second order Systems by Hairer 公 
Lubich (2004). 
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Consider a stable, symmetric multistep method (1.1) and denote the zeros of its 
characteristic polynomial po(C) by 61 = 1 (principal roob and CG2，...,GR (parasitic 
Ioots). We then enumerate the set of all fmnite products， 


{ 人 元 三 从 1 339) 


It is {2 一 人 一 1} for method (3.13) and {1, 一 1} for the explicit midpoint rule 
(3.14). The set of subscripts Z can be finite or infinite. We let 7 = 了 工 \ {1} and 
we denote byZN andZN the finite Subsets of elements which, in the representation 
(3.15), have 2 21 < N. 

Motivated by the previous examples and by representations of the asymptotic 
expansion of the global error of weakly stable multistep methods (see for example 
Sect. JU.9 of Hairer Nwrsett & Wanner, 1993), we aim at writing the general solution 
yn of the multistep method (1.1) in the form 


名 =gnj 十 >》 人 ze(n 站 ， G.10) 


LETZ* 


where y(b) and zz (are smooth functions (with derivatives bounded independently 
of 几 ). The following result extends Theorem 3.1. 


Theorem 3.S， Co7zsider ad staBple, co118iste115 G12d Sy111111et1ric 1IUL1iStep 11etpod (7.1) 
For eyery 1ru1cation ider N 之 2 1jere 加 e1l exisf 几 -ipndepe1de1t jzactio118 
户 j 0) mV 大 辐 =(z) 人 sucj tpatjporeverysotution oF 


及 = 户 汪 ( 凡 辐 ) 十 及 记念 天 ) 二 十 1 放 ( 沪 于) 

= 大 ia 十 六 Pa) 十 .十 TIRN) Jor2<L< 天 

2 一 玫 庆 2( 力 2) 十 .十 大 vi Jor > 大 (3.17) 
2 =0 Jor 上 g&T 





Wi iitialyvalues zz(0) =O() Jor2 生 L4 挟 帮 1ejinction 


z 介 一 g 昌 十 》 6 ze (3.18) 
LEZZ* 
SG1iS1jres 
关 因 
>》 "azz(t+7J = 万》 DJ1(zG+7) + OORT+D)， (3.19) 
7=0 IJ=0 


For 2 二 01jedjerential eqguation Jory ms tpe samie ay 11at of TH1eore1l 3 了 7. 71e 
SOLUtio1ns of (3.17J satis 放 ap 人 区 = 页 均 wjpenever Ge = G and Wis yelation jpolcts 
Jpr tpe aitial yalues. Moreover zz 四 = Opm+1) om poxnaded 1pnre intervals 太 Ce 
zaproduct ofno jemwertpanmm 之 27oof ofp(0). 


Pioof Welet21 信 :=V 罗 andinsert the finite sum (3.18) into (3.19). This yields 
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天 六 
的 间 丰 9 交 0 


7=0 7 一 0 人 
呈 亲 cr 和 aiGenpatg = 》 Ginp(GeenP)ze 介 ， 
ZE 工 JI 一 0 ZE 工 


where, as Usual,，D represents differentiation with respect to time. We then expand 
jz() into a Taylor series around y(t)， 


jp = 科 二 jg( 和 信和 的 并 cs 的 ) 


20 之 0 全 ES LE 了 Z* 
有 1 7 
0 
LE 了 7 过 0 Ge1 …Gem 二 6 


This gives, as above， 


号 Dij(ztt 二 7)) (G3.20) 
> >》， Co 人 CeenD) >》， >》， jm) (人 O)) (2 人， 2 信 ). 
ZE 工 mm 二 0 ”Co 6em 一 Ce 


Comparing coefficients of 6v 办 


peerp)jz= ho(Goenp) 和 > fm(OJ(z zt) G2D 


mmZ0 ”Ce Gem 三 Ce 


forleEZNin(3.19) thus yields 


(forZ =1and7n =0thesumis understood to include the term jy)). With the 
expansion Za(Ceez)/p(Ceez) = MoTTAIZTHMoz2 二 .forl 和 < Fwhere 
Ce is asimple root of p(G), this equation becomes 


入 一 (ucoTTHanD 二 . 由 冯 二 号 7m(g)(z zt)，G22) 


7m00 人 0 


and with a(Geez)/p(Geez) = No+HeIZ 二 Heoz2 二 .forl > jwherep(Go) 天 0 


2 二 pa(ueo+HanD+…) >》 加 >》 JoD()(zo ztm)， (03.23) 


mmZ>0 ”Ge Cem 一 Ge 


In the usual way (elimination of the first and higher derivatives by the differential 
equations and by the differentiated third relation of (3.17)) this allows us to define 
Tecursively the functions 户 j(y,Z”). 

From this construction process it follows that on bounded time intervals we have 
2 人 划 三 Oforall > 2,andz 人 = On+i) ifGeis aproduct of no fewer 
than 7m0 之 2 roots of 0(G). P (3.20) and in the above construction of the coefficient 
functions 盛 j(V2) wehave neglected terms that contain at least N factors zj). This 
gives rise to the O(PANT+1) term in (3.19). 
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Initial values V(0), ze(0), 上 = 2,.….,K, for the system (3.17) are obtained from 
the starting approximations yo0,...,W_1 Via the relation 


历 一 WO 阅 二 >》 Gaze0， 了 一 0 天 一 1 G.24) 
《Er* 


For 灭 = 0 this represents a linear Vandermonde System for yV(0),， zz(0). The Im- 
plicit Function Theorem thus proves the local existence of a Solution of (3.24) for 
Sufficiently small step sizes 妃 . 这 yj 7 三 2,.. .5 approximate a solution yez(t of 
1 三 jy) with an error O(j) (with s 冬 了 十 1, wherer is the order ofthe method)， 
then Vy(0) 一 yez(0) = O(1) and zz(0) =@O(1) for2 = 2 大. 

The representation (3.16) of the numerical solution and the (Principal and para- 
Sitic) modified equations (3.17) will be the main ingredients for the study of long- 
term stability of multistep methods in Sect.XV.3. An extension of the previous the- 
orem to partitioned multistep methods is more or less Straightforward. We leave the 
details as an exercise for the reader. 


Multistep Methods for Second Order Differential Equations. A completely 
analogous result can be proved for stable, Symmetric multistep methods (1.8) ap- 
pliedtoY = jy). We again denote the zeros of p(C) byGi = 1andGoL = 2 .9. 
Notice, however that 61 = 1 is always a double zero, and the others can be Simple 
or double zeros of p(G), and that g 和 K. We consider the index sets 了 , 7 , ZN, and 
ZN as in (3.15). 


Theorem 3.6， Co7zsider da siaple, co118iste11 G120 Sy111111et1ric 1IULGiStep 1Ietpod (7.8). 
For eyery 1runcation idexr N 之 2 1jere 加 e1 exist 有 -indepe1de1t jzactio118 
记念 太 玫 ) (wpere Zr demotes ipe veclor collecting ds ele1nen1 2 各 这 Ge 10 
douple oobp and ze ifGeiasiaplie rooforp(C)) sucz tpPatjorevery sotution or 





7 一 万 (0 2 ) 十 用 广 2(2 轨 2) 十 .十 7 开户 ww( 凡 成 于 ) (3.25) 
2 二 1 思念 于 ) 十 .十 AT 人 2) 矿 p(Go) 一 0 (Co) 三 0 
和 = ja 六 2 二 十 迪 js 人) 这 pc =0, oo 关 0 





了 2 产 3( 人 太守) 机 2 二 PT Na( 仿 2) 矿 DC) 天 0 
2 三 0 Jor 2Z&TN 


ZL 


wiiitialvalues xz(0) 三 OO) Jor2 和 LS< 0 thejinction 


zz 的 一 8 四 十 》 6 ze 人 (3.26) 
Sais1ies 伦 侣 
开 开 
>》 ojizlt+J = 有 >》 Bcl 和) + OPT 3 (3.27) 
5J=0 J=0 


For 号 一 01jedterential eguation Jory zs tpe sa11re das 胡 (3.9) 7T1e solutions of 
(3.25) sa1is 放 zz 三 丈 ( 轨 wjpeneyverGe = 人 aad1Jis relation jpolads Jor tpe iitial 
Values. MoyeoyeF zz( 轨 = O(PmY+2) om pox1aded ine intervalsy 太 Go ia product oF 
10 jewer tpanmm 之 27oox orp(0). 
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Proojf In complete analogy to the proof of Theorem 3.3 we obtain 


p(Ceei)ze = jh2co(Cee2D) 2 >》， FU) (zo 20) (3.28) 


mrZ0 《el Gem 一 Ge 


which differs from (3.21) only in the factor 12. Depending on whether Ce is 
a _ double,，a Simple，or not a zero of p(G)，we expand 7z2c(Geez)/p(GCeez) or 
2Z0(Geez)/p(Gez) or al(Geez)/p(Gez) into aseries of powers of z, and we denote 
its coefficients by /2, 刻 This then yields 


.， 工 
关 = (Upo+HaRD+ 六 /0m 殷 (zato)，G.29) 
mm0 Cel Cem 一 Ce 


这 p(Ce) = 0(Co) = 0,but 0 (Co 和 0(Gnparticular forZ=1andGl = 了)， 
工 
坊 二 pa(ueo+HaanD+…) 定 志 守 7 国 (zzeo)，G30) 


mmZ>0 ”Ge Cem 一 Ge 


io(Go) = 0, but mw(Co) 关 0,and 


妈 一 12(peo+HeaAD 二 >》， 本 >》 Jo(J)(zo zem)，G3.31) 


mmZ0 6666 一 Ce 














过 p(Go) 夭 0. Therest of the proof is identical to that of Theorem 3.5. 


For the system of modified equations (3.25) we need initial values V(0),V(0)， 
2(0),2(0) 让 Ge is a double root of p(C), and ze(0) 这 Ce is a simple root. These 
injtial values can be obtained from the starting approximations Vo,...;,Vk_1YViathe 
Telation (3.24). 


Lemma 3.7. Co7sider a SiGD1e，Sy11111etric 11IUUtistep 111etpod (1.8) of order 7 ad 
jet 太 e Sia11i118 GDPP1oxinaa1io1s y0) .UK_1 Sa1S 记 帮 一 Wez(T1 三 OO) wii 
2 拉 s 挟 7 十 2.71emn 1jere exist (localy) 11GuUe IIUGL vaLUles Jor 1Pe yysfe11 (3.23) 
SUCP 1Pa1 IN SOLULOT exXactiy Sa1is1jiesy (3 了 .24). 

Tjese initial values satis 太 xp(0) = 矶 (0) 矿 C = GT aad 


(0) E yez(0) 二 OU )， A0(0) APVez(0) 二 OU )， 
0)=O(1)，P2z(0) = O(RS) 矿 Ce ma dorvplie oob (3.32) 
0) = O(1)， 矿 Ci asSi1aple 1oo 


Pioof We scale the derivatives by 几 , and consider Vy(0), V(0), ze(0) and PRze(0) as 
Unknowns in the System (3.24), where V(b and zz 人 (为 are a solution of (3.25). For 
由 二 0alinear confluent Vandermonde System is obtained. Since this ls an invertible 
matrix, the Implicit Function Theorem proves the Statement. 
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XV.4 Can Multistep Methods be Symplectic'， 


Readers might be astonished to fnd a question mark in the title. The reason is that 
we Shall present two definitions of symplecticity of multistep methods applied to a 
Hamiltonian System 


六 = 一 瑟 v(D, 9)， d= 万)(D,9). (4.1) 


One works in the phase Space of the exact fow, the other in a higher dimensional 
Space. But which one is suitabley” We further show that certain multistep methods 
can preserve energy over long times, even 让 they are not Symplectic. 


XV.4.1 Non-Symplecticity of the Underlying One-Step Method 
A conjecture due to Feng Kang. (Y.-EFE. Tang 1993) 


Anatural deftinition of symplecticity consists of the requirement that the underlying 
one-step method (TIheorem 2.4) be symplectic. This means that the (truncated) mod- 
iftied equation (3.1) is Hamiltonian. Unfortunately, we have the following negative 
TeSult. 


Theorem 4.1 (Iang 1993). 7T7Pe xnderiying one-step 11etjpod or a co1msisteP1 1170ea7 
JIUL11sSteD 11e 太 od (7.7) ca11zot pe Sy11DUectic. 


Proojf We Show that the first perturbation term in the modiftied equation (3.1) is in 
general not Hamiltonian. From (3.4) we know that 广 41(V) = Mr(CDIT 访 (y) which 
(omitting the non-Zero error constant /Jr) l$ given by 


和 arFPog = 和 一 Mr)F (Cd92) 


at7T) 
7TE7T ,|7| 王 7 十 1 7TE7 ,|7| 王 "十 1 


with bg(7) = 1/7y(r) for |r| = 了 十 1 Oheorem IIUI.1.3 and (IIU.1.27)). Suppose 
now that (4.2) is Hamiltonian for all separable Hamiltonian vector fields jyV) 三 
J1LV 万 ( 力 . Theorem IX.10.4 then implies 


ol(uov) 十 puoz) 一 0 foral ve with lv 十 |v| = 十 |. 


This, however ls in contradiction with 


工 工 工 工 


Toy yo 可 TO TO 


which is a consequence of Theorem VI.7.6 (because the exact Solution ls a Symplec- 
tic transformation and, as a B-series, has coefficients w(7) = 1/7(7)). 

















A similar negative result holds for a much larger class of integration methods. 
For example, it is proved by Hairer 儿 Leone (1998) that, among the class of one- 
leg methods (See (4.7) below), only the implicit mid-point rule ls Symplectic (in the 
Sense that the underlying one-step method is Symplectic). 
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了 Partitioned Linear Multistep Methods. We know at least one Symplectic method 
of the form (1.14). It is the Symplectic Euler method (VI.3.1), which combines the 
implicit and the explicit Euler methods. However we do not have better within the 
class of partitioned multistep methods as is Shown in the next theorem. 


Theorem 4.2. 矿 1Pe Under1yinzg one-sfep 11etpod oa co1zsiste11 PaFHiLiOed 1172ear 
11IUL118S1eD 111e1pod (7.74) 18 Sy11DUectic Jor G1 separaplie 万 a11ltliomia1z 8SySte11S，17e1 
is o1der Satisjiey 7 苹 1. 


Proojf Suppose that the order of the method is ” > 2. By (3.8), the dominant per- 
turbation term in the modified differential equation is MO2r(DY 诱 (y,z) for the 


V-component and HB)jr(DI 9g)(y, 2z) for the z-component (at least one of the co- 


efficients 1 and HL) is non-zero). This is a P-series with coefficients (7) 三 


HA 人 7) 让 7 E 7TP,Ir|=7 十 1andb(7) 三 AHB) /my(r) 证 7 E 7TP,|r| = 了 十 |. 
Hthe underlying one-step method is Symplectic (ie., the modified differential equa- 
tion is locally Hamiltonian), Theorem IX.10.4 implies that 


b(uoo)+buo)=0 for ETPvcsTP,JU+lvl=7+1 (4.3) 


Taking forw Eee 77 thetree with one vertex, andforv E 77o an arbitrary tree with 
lu| = ”, condition (4.3) gives the first relation of 
4 妃 妃 4 
大 和 Lor 由 有 


Fr TOO TH TCD 一 


Exchanging the colour of the vertices gives the Second relation. This contradicts OUT 
asSumption7 之 2. 




















开 we restrict our considerations to Hamiltonian Systems with 


业 
五 2,g = 5 Cp+cD+U(9)， (4.4) 
where the kinetic energy jls at most quadratic in D，we can find Symplectic, parti- 
tioned multistep methods of order two. Indeed, the combination of the trapezoidal 


rule with the explicit midpoint rule 


必 
Dnp+l 一 Dmn 一 了 (Y5los) 本 VU(w)) ) dmn+lL 一 4 一 1 一 27(Cpn 求 c) (4.5) 


has the Stormer-Verlet method as underlying one-step method. This is Seen as fol- 
lows: Since the Hamiltonian is Separable, formula (VI.3.4) yields the first formula 
of (4.3). The second relation ls a consequence of an+1 一 an 十 ALCCPn+172 十 c) and 
Dn+l/2 十 Dn_1/2 一 2Dn, and uses the linearity of 五 p(P, 9). 

Also for this special class of Hamiltonian Systems we cannot achieve high order 
and Symplecticity at the Same time. 


Theorem 4.3. 太 1je nder1ying one-sfep 11etpod oa co1zsisfe11 PaQFHiLiOed 1170ea7 
11I2UL118Step 111etjod (2.7 和 1 Sy1Dlectic Jor al Famazliomia1z syste1118 Wit 已 G11a1zl1o12ia1 
or1jpe jormmza (4 也 1pen it ordersatisjes 了 苹 2. 
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Prooj The beginning is the Same as that for Theorem 4.2. We let7 > 2 be the order 
of the method (A) so that 1 和 0. Instead of (4.3) we now have to use 


bl(woov) 一 pvooz) 一 0 for VE TN， 十 三 凡 (4.0) 


which also follows from Theorem IX.10.4. Taking forv E 7Nop the tree with one 
vertex, and forv E 7TNp an arbitrary tree with |v| = 了 一 1 condition (4.6) gives the 
relation 

pr- 有 
2+DYo) rr+TD7O) 


which is contradictory for7 > 2, because 0) 天 :让 

















Remark 4.4.， We believe that the statement of Theorem 4.3 remains true, 让 we 
restrict OUr consideration to Hamiltonian functions (4.4) with c = 0 and invertible 
matrix C. Since multistep methods (1.8) for second order differential equations can 
be converted into partitioned multistep methods, this then implies that methods (1.8) 
cannot be Symplectic unless the order Satisfies7 所 2. 


XV.4.2 Symplecticity in the Higher-Dimensional Phase Space 


We present here a Second approach for the deftinition of Symplecticity of multistep 
methods (more precisely, of one-leg methods). It is much inspired by the C-stability 
theory of Dahlquist (1975) for the study of stiff differential equations. 

To Simplify the nonlinear Stability theory of linear multistep methods (1.1)， 
Dahlquist (1973) introduced the So-called ome-leg 1etpods, which are defined by 


the relation 
大 开 
2 =( Bat (4.7) 
J=0 J=0 


Where the normalization ac(l) 一 227 Dj = 1is assumed, In fact there is a close 
relationship between the numerical solution of (4.7) and (1.1), and their long-time 
behaviour is the Same (See Sect.V.0 of Hairer 贸 Wanner,， 1990). In the following 
We consider the Super-vectors 到 一 (yn 1 2) 工 collecting 8 consecutive 
approximations of the solution. 


Definition 4.$. Let G be an invertible symmetric matrix of dimension K. A_K-Step 
multistep or one-leg method is called C-sy1azPlectic 让 


ZTCGQ@9 DT=2(G@59)Z， (4.8) 


whenever the differential equation 7 = /yy) has y7.Sy as invariant (with symmetric 
3), ie., the vector field satisfies y72 .SO = 0for all y. 


It is of course also possible to express this definition in terms of differential 
forms. As a consequence of Lemma VI.4.1 the conservation of quadratic first inte- 
grals is equivalent to Symplecticity (Bochev 儿 Scovel 1994). 

In contrast to the negative results of Sect.XV.4.1，there exist a lot of C- 
Symplectic methods. We have the following result. 
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Theorem 4.6 (Eirola & Sanz-Serna 1992). 尼 very 17r7reduciple 8y11z111etric oO11e-leg 
Jet1jpod (4.7) 18 G-Sy111D1ectic Jor so111e 1IQ11iX @. 


Proof We recall that a one-leg method is irreducible 过 the generating polynomials 
0(0) and (5C) have no common zeros. 

Coms1truction of G. The symmetry relation (1.3) implies p(1/C) = 一 Cep(0) 
and ol(1/6) = 64c(O). Consequently, the polynomial p(C@Jc(w) + p(w)c(G) van- 
ishes for w = 1/G, and contains the factor Cw 一 1. We then define CG by 


开 
(p(Oc(w) +polojc(O) = (Go-D >》 90561o5 (4.9) 


让 7=1 


MP 1 王 


The matrix C obtained in this way is Symmetric， 
Resguiaripy ofG. Applying the geometric Series we get 


达 
op6roj = -io(Oe+plojo(O)G+ 人 + 二， 


让 7=1 


where the identity holds as formal power series. Suppose that the matrix C is not 
invertible. Then there exists a vector 一 (uoyt ,WA_I)7 such that Gu 一 
0. We formally replace the appearances of w7-1 with wii for7 < 有 and with 
Zero for 7 > K. This gives an identity of the form 0 = p(G)a(G) 二 a(OCG) with 
polynomials w(C) and p(C) of degree at most 大 一 1, and we get a contradiction with 
the irreducibility of the method. 

G-Symaplecticity. We next replace in (4.9) Ciw7 with yiSyn+ 六 Together with 
(4.7) this yields 


开 天 
ji( po] sj( po 由 = 到 Ge-Cesm 
2 一 0 2 一 0 


oo 1).This proves (4.8) for all functions jy) satisfying 














了 xample 4.7， We consider the explicit midpoint rule (1.6), which is also a one-leg 
method, and the 3-step method (3.13). By Theorem 4.6 the one-leg versions are 
C-Symplectic. Following the constructive proof of this theorem we find 


0 1 1 
0 1 
c=(! 站 and G=|11 -2 1|， 


Tespectively. We apply both methods to two closely related Hamiltonian Systems， 
namely the pendulum equation with 瓦 (pz,q) = 22/2 -- cos dg and a perturbed prob- 
lem with 五 (pz,dq) = 22/2 -cosd(1 - P/6), and we study the preservation of the 
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Imlidpoint rule 
pendulum 


Imidpoint rule 
pert. problem 
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全 ， | 
0 500 1000 0 300 1000 


Fig. 4.1. Numerical Hamiltonian 妃 (pu, gm) 一 瓦 (po,qo) ofthe explicit mid-pointrule and the 
3-step method (3.13), applied with step Size 下 = 0.01 to the pendulum problem ( 万 (p,9) 三 
D2/2 一 cos dg) and to aperturbed problem ( 娓 (p,g) = D2/2 一 cosq(1 一 p/6)) on the interval 
[0, 1100] (only every 131st step is drawn) 


Hamiltonian (See Fig.4.1). The result is somewphat Surprising. The midpoint rule be- 
haves well for the perturbed problem, but shows a linear error growth in the Hamil- 
tonian for the pendulum problem. On the other Slide, the weakly stable 3-step method 
behaves well for the pendulum equation (which is in agreement with the stable be- 
haviour of Fig.3.1), but has an exponential error growth for the perturbed problem. 
Notice that different Scales are used in the four Pictures. 


The above example illustrates that C-Symplecticity of a numerical method is 
not sufficient for a good long-time behaviour. It is necesSsary to get under control the 
parasltic solution components. 


XV.4.3 Modified Hamiltonian of Multistep Methods 


After the negative results of Sect.XYV.4.1, we are fortunately also able to prove pos- 
itive results concerning the near conservation of the Hamiltonian. 


Theorem 4.8. For a sy1a111e11ic，co118iste1t 1112ea7 1IUUListep 11etpod (1.7) oforderr 
applied io = JIV 厅 (1Pere exisfg aseries of1Ppe jp 


刀 ( 轨 = 万 (十 大 丽人 十 旗本 +3( 仿 十 ， (4.10) 


WAicp is ajJorraal Jrst Integ1ral oftipe 11odijed equatio1 (3.7) Witpouxt 1ru1icatio1. 
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Pioof With pf(ez)/(za(ez)) = 工 十 困 27 十 困 ra22r12 十 .itfollows from (3.3) 
that the Solution of the modiftied differential equation Satisfies 


(十 图 1Dr THat2D2 上 + = JIVFJ)+OOAN)， (1D 


Where, due to the symmetry of the method, only odd derivatives of y( 妨 appear. We 
multiply both sides with g7 .7 so that the right-hand side becomes the total derivative 
有 芋 万 (四 . On the left-hand side we note 好 .JI = 0, 拒 7TV) = 半 ( 红 .JJ) and 
Similarly for higher derivatives 





Q 7 . 要 疯 坪 7 77 
抽 7g2rtD = 三 ( 打 7yem 一 负 Jyem 二 十) ， (12) 


We thus obtain atime derivative of an expression in which the appearing derivatives 
can be Substituted as functions of V via the modified differential equation (3.1). 
Altogether this yields 


Cd 和 7 十 2 二 dl JIV 
一 导 ( 克 机 Ha 的 十 术 茹 本 ta( 扫 十) = 天 瑟 人 二 Oh) 











which proves the statement. 





The statement of the previous theorem ls Somewhat Surprising. The underlying 
one-step method, although not Symplectic, nearly conserves the 了 Hamiltonian for 
general 达 (V) (not even reversibjlity is required). This indicates that the condition 
(X.9.20) can be satisfied for all trees also by non-Symplectic methods. 

For partitioned multistep methods we do not know of a similar result unless 下 
we restrict ouUr consideration to Hamiltonians of the form (4.4). In this case we are 
concerned with multistep methods for Second order differential equations. 


Theorem 4.9， For a sy1a111e11ic，co118isfe11t 1112eG7 11IUULisStep 11etpod (1.8) of order7 
aDplied jpo 了 = 一 VDU( tpere exis1s dyeries oftjpe Jo 

2 1 . 关 ， 六 。 入 ， 
豆 (9, 信 = 了 人 证 切 十 大 丽 H(0 人 十 天王 肌 Ha(0 仿 二， (4.13) 
WAicp is ajJornaal Jrst inateg1ral oripe 11odijed equatio1 (3.9) WitPouxt 1ru1icatio11. 


Prooj The proof is very Similar to that of the previous theorem. We expand 
p(ez)/(z2o(ez)) =1 工 + 力 十 方 HZ 十 .andsimilarto (3.10) we obtain 


(十 六 了 Dr 十 Wapr+2DrT2 二 .这 二 一 VD 二 OCO)， (4.14) 


This time we multiply both sides with 好 .The right-hand side becomes the total 
derivative 号 U7(), and for the left-hand side we use 好 了 = 半 ( 好 97) and forhigher 
even-order derivatives 





。 TD Q 。 7 一 局 ?7 一 工 7 77 
jzryem 一 (ye 二 3 区 )7/ ) (4.15) 


JIntegrating and Substituting Second and higher derivatives of V via the modified 
differential equation (3.9) yields the desired formal first integral close to the Hamil- 
tonian of the System. 
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The formal first integral (4.13) does not depend on how approximations to the 
derivativev = 9 are obtained. If the derivative at grid points ls numerically com- 
puted with the formula (3.11)，then one can Use the one-to-one Correspondence 
(3.12) to express the coefficient functions of the modified differential equation in 
terms of y and 


XYV.4.4 Modified Quadratic First Integrals 


Symplectic one-step methods exactly preserve quadratic first integrals (Sect.IV.2). 
This is not true forthe underlying one-step method of symmetric multistep methods. 
However as we Shall prove in this Section, it nearly preserves Such first integrals. 


Theorem 4.10. Lef Q(y) = CCy (wii0 asymaaretric maa1rix C) pe a st integral 
ory = jy). Forasyma11aetric co1lsiste1t 1i2ear 11tistep 1ietpod (7.1) oforder 7 
1jeye 加 e1l exis1s da Seies of 1je Jo 


Q@(= 氏 Cy 二 Q@rH( 轨 十 MT2Q@Hs( 妇 十. ， (4.10) 
WAicp is adjJornaal Jrst Integ1ral oripe 11odijed equation (3.7) Witpouxt 1ru1icatio11. 
Proof We multiply (4.11) with y7C und thus obtain 
到 CU 六 十 TD 十 六 王 到 Cj 上 +O(). 


Since VCyw is afirst integral, the term on the right-hand side vanishes. For the terms 


on the left-hand side we notice that VCV 一 二 呈 (07Cy) and that 





7T70 dl ITT0 和 ?7 一 1 we 2 
7CyCm+D 一 元 (7Cye ) -0TCy2m-D 十 ... 土 WOTCW 站 (4.17) 











As in the proofs of Sect.XV.4.3 we now deduce the Statement. 





Asimilar result holds for second order differential edquations and methods (1.8). 
This concerns for example the total angular momentum in N-body Systems. 


Theorem 4.11. Swuppose 1Pati= jy) pasZ 人 三 IEBVasjzrstiniegral ie 
1 SKew-sy1z111etric ad 07 瑟 Fo) = 0. Forasynma1aetric comsistenf 1inear altisfep 
11e1jod (1.8) of ojderm 1pere 1je1 exis1s da series of ipe Jo111 


也 (信用 三 氏 本 十 杂念 十 人 Ts( 轨 信 十 …， (4.18) 
WAicp is ajJoraal Jrst Integ1ral ortipe 11odijed equatio1 (3.9) WitPouxt 1ru1icatio1. 
Proof Multiplying (4.14) with ?已 gives 
多 于 和 放 了 有 十 和 1T2Dr 十 六 = 有 瑟 FJ) 上 +O()， 


The term at the right vanishes. Since 已 js a Skew-Symmetric matrix, we have for the 
termas to the left that y 工 甩 7 一 全 BY and that 


d 和 0 
多 Bem 雪 一天 (区 er 一 拉 Dyem 十 十 Bgynt) 人 419) 














This yields the statement as in the previous prootfs. 
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Remark 4.12. Noticing that the underlying one-step method of a symmetric mul- 
tistep method can be expressed as a formal B-series (cf. Sect.XV.2.2), it follows 
from (4.17) that the modified first integral of Theorem 4.10 is of the form (VI.8.0). 
By Theorem VI.8.3 the underlying one-step method is therefore conjugate to a Sym- 
plectic integrator. 

A Similar result holds for symmetric methods (1.8) complemented with a Sym- 
metric derivative approximation (3.11). The variables v and y are related via (3.12) 
having an expansion in even powers of /. Substituting y = V(Vv) of this relation 
into the modifed first integral (4.18), we obtain an expression of the form (VI.8.11). 
Here, the elementary differentials correspond to the system7 = 之 = JW) (has 
to be identified with 2z). Theorem VI.8.8 combined with Theorem 4.11 proves that 
the underlying one-step method is conjugate to a Symplectic integrator. 


XYV.S Long-TIerm stability 


The results of Sects.XV.4.3 and XV.4.4 imply the near conservation of the total 
energy and of the angular momentum in N-body problems for numerical solutions 
of the underlying one-step method of multistep methods. This, however is of no 
value as long as the parasitic solutions of the mujltistep method are not under control. 
The Present Section ls devoted to the study of the Stability of numerical Solutions 
over long time intervals. 


XV.S.1 Role of Growth Parameters 


The analysis of this section ls based on the representation 
名 =W(n 癌 十 >，C2 ze(nh (G5.D 
LET* 
of the numerical solution of a multistep method (cf. formula (3.16)). 


Linear Multistep Methods for First Order Equations. By Theorem 3.5 the par- 
asitic components ze (for2 和 上 L< 和 jarethesolution ofadifferential equation 
which, by (3.22), is of the form 


和 三 He ze 十 : (5.2) 
This is just the variational equation of y = jy) scaled by 
cce) 05.3) 





0 


Which is the So-called grow 态 parazaeter as introduced by Dahlquist (1939) and 
motivated there by a linear stability analysis (See Exercise 93). 

We shall illustrate at the examples of Sect.XYV.3.2 that the study of the truncated 
equation (3.2) gives already a lot of insight into the long-time behaviour of multistep 
methods. 
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凡 二 二 0.3 
2 350 0 To 00 50 
-二 





Fig. S$.1. First component of the solution of the pendulum equation (grey) together with the 
Euclidean norm of the solution v(t of the scaled variational equation (S.4) 


卫 xample $.1， For the pendulum equation, the truncated equation (3.2) ls 


人 1 22 V1 0 1 VU1 
(的 (全 (二 en 可 人 6 
We fix initial values as y(0) = (1.9,0.4)7 and uv(0) = (0.1,0.1)7. Figure 5.1 shows 
the solution component Wi(t) in grey, and the Euclidean norm of v(t as solid black 
line, once for / = 一 1 and once for / = 0.5. We notice that the function v( 娘 
remains Small and bounded for/ = 一 0.5, and that it increases linearly for / = 一 工 . 

This agrees perfectly with the observations of Figs.3.1 and 3.2，because the 
method (3.13) has growth parameter /1 三 一 0.5 for the roots Ge = 土 ?, whereas the 
explicit midpoint rule (3.14) has 1 = 一 1L for Ge = 一 |. 





The same analysis for partitioned multistep methods allows one to better un- 
derstand the behaviour of the different methods in Fig. 1.3. The leading term of the 
parasitic modified equations depends on whether Ce is a root of both polynomials 
04(0) and ppB(G), or only of one of them. This is very similar to the Situation en- 
countered with multistep methods for second order differential edquations which we 
treat next. 


Linear Multistep Methods for Second Order 上 Equations. Theorem 3.6 tells us 
that the modified equation for the parasitic components ze depends on the multiplic- 
ity of the root 6&. Consider a stable, symmetric method (1.8) forY= .ICceisa 
double root of p(6C), formula (3.29) yields 


2c(ce) 
加 二 Hp 疡 ( 轨 z 十 .…， HE = 姜 - 人 ， (3.5) 
GO0(Gg) 
Where we have not written terms containing at least two factors zj 玉 Ge is a single 
Toot of p(C), we get from (3.30) that 


ao 
1 Ge (Ge) 


There is an enormous difference between the parasitic modified equations corre- 
Sponding to double or single roots of p(6). Equation ($.S) is the complete analogue 





2 三 je 天 (yy)ze 利和 (3.0) 
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of (9.2) and, as before, the long-time behaviour is hardly predictable and strongly 
depends on the growth parameter. For Single roots, however we are concerned with 
a first order differential equation (3.6) having an additional factor 尺 as bonus. For 
the analysis of Sects.XYV.3.2 and XV.3.3 it is Important to have only Single roots. 


Definition S$.2. A symmetric multistep method (1.8) for second order differential 
equations is called -staple 记 apart from the double root at 1, all zeros of p(C) are 
Simple and of modulus one. 


The linearized parasitic modified edquations give much insight into the long-ttme 
behaviour of multistep methods. To get rigorous estimes over long times, however， 
further considerations are necessary. A partial result is given by Cano 狠 SanZ-9erna 
(1998) for multistep methods (1.8) applied to equations y = jy) with periodic 
exact Solution. There, the first terms of the asymptotic error expansion for the global 
error are computed, and their growth as afunction oftime is studied. We shall follow 
the approach of Hairer 儿 Lubich (2004) who exploit the Hamiltonian Structure of 
Second order differential equations. 


XYV.S.2 Hamiltonian of the Full Modified System 


In the remainder of this section we restrict ouUr conslderation to S-stable, 1irreducible 
linear multistep methods 


开 开 
>》 aijgH = 一 妇 》 DiVU(o 让 ， G.7) 
J=0 


9=0 
applied to Hamiltonian Systems written as 
4= 一 VV(9)， (5.8) 


where U(d) is assumed to be real-analytic in the considered region. 

The key to proving long-time error estimates ls the observation that much of the 
Hamiltonian structure is conserved in the modified equations (3.25). The results and 
techniques of this Subsection are closely related to those of Sect. XIHI.6.3 developed 
for numerical methods for oscillatory differential equations. 

Weletz = (2)vezw and define 4(z) as 


x@= To+ 和 记 >》 TU)(zoj(za zeon)， (0.9) 


mmZ1 《ecem=1 


Where the second sum is overallindices 如 ETN,， .tn EZN WihGe .6 三 工 
(using the notation of Sect. XV.3.2). Since the roots of p(G), different from 61 = 1 
are complex and appear in pairs (Exercise 3), also the functions ze appear in pairs. 
It is convenient to use the notation 2 三 2 让 Ce 一 ( 

It follows from (3.28) with /dg) = VV(q) that every solution of the truncated 
modifed equation (3.2S) Satisfies 
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p(Ge)z = 一 io(Gee)Y。 WU 十 OO (5.10) 
人 (forallle27Z)aslong as 
VE 天 SSM，lzelsotor1l<t<R， (5.11) 


where 开 is a compact Subset of the domain of analyticity of V(q)，M > 0 some 
bound on the derivative, and0 < 0 = O(jP) is a sufficiently Small constant (note 
that this implies | zol 和 oforalle 公 证 Pissufficiently small, cf the algebraic 
Telations of (3.23)). 

For ease of presentation，we assume for the moment that ol(Ce) 夭 0 for all 
2EZN(weknow thatthis holds for1l < 上 < K,because the method is irreducible). 
We apply the operator ac-1(CeeP) to both sides of (5.10) and divide by 72: 


h 2 人 () (CehD)z = ~-V。 ,2 人 上 ON (5.12) 
We then multiply with 3 江 2 Sum over al LE ZN, and thus obtain 
能 二 巡 ， 光 儿 - )(ceexp)ze+ 亏 4 ztz)=OU) (5.13) 
ZEZN 屁 


We now Show that also the first expression on the left-hand Side is a total derivative 
of a function depending on z and its time derivatives. For this we note that 


(eco) =》 "cojm withrealcoefficients coj = (-1ic_ oj， (G.14) 
IJ>0 


This holds because the symmetry of the multistep method yields (p/o)(1/G) = 三 


(p/aj(G) and hence, for real z, (p/oj(Gce) = (p/cj(Ces) = (Ooj(Cen 
With the expanslion (9$.14) we obtain 





AT 二 1 
(2 ) (Cee?2) )ze 一 2 ce 万 )7 z0) 二 OU(AN+2)， (5.15) 
IT 


To study (3.13) we apply the relation (4.12) for the real function y = 21 and for 
2e Corresponding to Ge = 一 1, while for the complex-valued functions z = ze,， with 
complex conjugate 达 = 2_ bi, We Use 





d 1 
Re 玉 zCm) = Re 元 外 D 一 六 zC2m-2) 十 .十 5 








Im 训 >zCm+D = Im [0 一 如 zC2m-D 十 .和 干 人 


Tosgether with (9$.15) these relations Show that the terms 


2 人 )(cexp)ze+ 弛 ke 
+1 
= 》 coj2Re(( 玉 372 多) 二 ON 
J=0 
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give atotal derivative (up to the remainder term). Hence the left-hand side of (5.13) 
can be written as the time derivative of a function which depends on ze, 上 EZN,and 
on their derivatives. Using the modified edquation (3.25) we eliminate all ze corre- 
Sponding to Ce with po(Ce) 夭 0 and their derivatives, the first and higher derivatives 
of ze (for1 < 上 < ,and the second and higher derivatives ofy 三 21. We thus get 
afunction 


(0 芒 ) = 五 0( 人 (思念 2) 十 .十 1 有 N_1( 汶 2) (5.16) 
with z* 一 (ze)4J ,such that 


Q 

dt 
along Solutions of (3.23) that Stay in a set defined by (3.11). The function 刀 ls 
therefore an almost-invariant of the System (3.23). 


Tt 的 天 提 ) = OU )， (5.17) 


If however oa(C) does have a zero Ce, then we omit the corresponding term from 
the sum in (3$.13). Hence the term 并。,UW(z) is missing from (qd/dbM(z) and 
must therefore be compensated in the remainder term. Since Ge ls a product of no 
fewer than two zeros of p(G), it follows from (3.31) and from Heo = 0 that ze 三 
O(P362),aslongas|zj 和 oforl<7y<K. WefurtherhaveV。 ,MU(z) = O(02)， 
so that the remainder term in (5$.17) is augmented by O(736 人 4)， 

We Summarize the above considerations (Hairer 儿 Lublich 2004) as follows. 


Theorem 5$S.3. 尼 very SOLUtiOo1 or 1jpe 1runcated 1aodified equation (3.23) Satisjiey， 
Wi 太 放 Jozz (5.70) 


(VD 全 的 ) = OO)+OG TO G.18) 
GS 1o118 ds 1je SOULU1io1l 8S1ayS zI 11e sel dejijied py (5.77) Moreoyver 
(信念 2 于 ) = 万 ( 加 人 二 OP) 十 O(P6 (5.19) 


The closeness to the Hamiltonian 万 (y,2) = 引 z2 二 V(y) follows also di- 
rectly from the above construction. For z* = 0 we have (yy 0) 三 万 (yw 人， 
where 万 is the modified energy from Theorem 4.9. 

We will use Theorem 3.3 in Section XV.6.1 to infer the long-titme near-conSserVa- 
tion ofthe Hamiltonian along numerical solutions. Before that we need to bound the 
parasitic components. 


XV.S.3 Long-Time Bounds for Parasitic Solution Components 


The modified equations have further almost-invariants which are close to the Squares 
of the norms of the parasitic components that correspond to the roots of p(C). We 
derive them here and use them to show that all parasitic solution components remain 
Small over very long times. The techniques used in this Subsection are Similar to 
those in Sects.XII.6 and XIIUI.7. 
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We considerl withp1 < 2 < for which Ce is asiaple root of p(C) and of(Ce) 天 
0. The dominant term on the leftrhand slide of (3.12) 1s 一 cl.1t 一 3， Since 


Cd 2 
天 ||zelP = ze2e 十 2 守 0 (5.20) 


we multiply (3.12) with 2 and the corresponding equation for Ce with 2 and 
we form the difference,， so that the dominant term on the left-hand Side becomes 
一 cei 一 呈 |zz2 note c_eil = 一 co1). Dividing by 一 col2j1 gives 





2 D DO 
(2 Seespjz 一 20Ceeso)z 

全 人 记 (zzev。 2 人) 十 28V22 人 四) 
Ci1 


(5.21) 


We first estimate the right-hand expression. Since 
V。,U(z) = V2U(zo)z 十 O(52)， 


as long as (3.11) ls Satisfed， we obtain from the Symmetry of the Hessian that the 
right-hand side of (5.21) is of size O()53). The dominant O(A63) term is present 
only 让 6C_e can be written as the product of two roots of p(GC) other than 1. Ithis is 
not the case, the expression ($.21) is of Size O(A64). 

Using the expansion (5.19) on the left-hand Side of (5.21) and the relations (for 
乞 三 20) 





Rez7z(2m+l 一 Re 人 一 广 z(2m-0 ... 干 人 


工 
2 


Im 也 人 大 训 >(2m) RS | 
dt 





(2m 十 2) 





ImzZlx 


we obtain that ($.21) is, up to O(A ), the total derivative of a function depending 
onz and its derivatives. 

By construction the dominant term is 羡 ||ze||>. The following terms have at least 
one more power of 尹 and at least one derivative which by (3.23) gives rise to an 
additional factor 凡 . Eliminating higpher derivatives with the help of (3.23), we arrive 
at afunction of the form 


Ce( 细 仿 于) 一 | 要 十 要 Kea( 沪 六 2) 十 十 je 1( 信 芒 天 )，(5.22) 


As we have seem its total derivative is of size O(h63) or smaller We summarize 
these considerations in the following theorem. 


Theorem 5S.4. 41ozg every solution or 1Pe 1ruacated 1iodified equation (3.25) 1Pe 
Jaction Ke( 22) satisjieysJor1<12O < 


Ce 的) 开罗) = eV0) 0 人 (0) 十 OU) 十 OU) (0.23) 
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GS 1o18 daS 1je SoOU1U1io1l StayS zl 11e Set dejied py (5.771) T1e seco1d error 1e1111 1 
1eplaced py O( 态 54 ifrno Joofofp(C) ofpertpanl1Tiztpeproductorpnvo other7ootfy. 
Moyeover 


Ke( 信 轨 于) 王 | ze 有 十 O(2262)， (5.24) 


This result allows us to write the numerical solution in a form that is Sulitaple for 
deriving long-time error estimates. Let us first collect the necesSary asSumptions: 


(Al) the multistep method (9.7) is Symmetric, s-Stable, and of order 7; 

(A2) the potential function V(da) of ($.8) is deftined and analytic in an open neigh- 
bourhood of a compact set 玉 ; 

(A3) the starting approximations q0,...,9qk_l are Such that the initial values for 
(3.25) obtained from Lemma 3.7 satisfy Vy(0) E 开 , 7(0) 和 M，and 
llze(0) 和 6/2forl1< < 局 

(A4) the numerical solution {dqn} stays for 0 < 7 几 科 全 in acompact set Ko which 
has a positive distance to the boundary of 开 . 


Theorem S.S (Hairer & Lubich 2004). 4sszmze (47) 广 (44) Fr SuiciemtDy S12G11 
刀 a10 6 and Jpr a jixed 1ruacation index N (iarge eolg1 sucp 1pafjny 一 O(54)) 
1pee exist jiactiozs yt aad ze 人 om a1 interval ofeng 纺 


了 =O(o) ) 


SUCAP 1Pat 


e dn 一 V(7 几 ) 十 》:， GO20(01) Jor0 芝 ?由 挟 了; 
EL 
@ O11 eye1y SUDipierval [JR (7 十 1) 记 ) 1Pe pctions yz 人 are asyoiution ortpe 
Syste11 (3 了 .23 
@ 1jpe jizactionsy zy zz 的 pave jaaip discomliauities ofsize ONT+T2) at tpe grid 
Doizatsy 7 
e | 2 区 | 和 56 Jor0<t<T 


太 1po 1oof ofp(C) othertpan Titpeproductorpno other Joofy al tjese estinzates 
C1e Valid om az interval of leng 态 全 一 O((RO2) 一). 


Pioof To define the functions y( 加 ,ze( 区 on the interval [7 (7 十 1) 刀 ) we consider 
the K consecutive numerical solution values dj 9j+1) .gj 1. We compute ini- 
tial values for (3.25) according to Lemma 3.7, and we let y( 旭 ,zz( 坟 beasolution of 
(3.25) on [77, (7 十 1]) 站 .Because their defectis OU ) and O(A+1),respectively， 
such a construction yields jump discontinuities of size O(jvX+2) at the grid points. 

Itfollows from Theorem 5.4 that Ke(V 人 ;72() remains constant up to an 
error of Size O(]263) on the interval [7, (7 十 T)j. Taking into account the jump 
discontinulties, we find that 


Ke 人 (8 圾 ,于 提 ) < Ce(Cy(0) 0)z(0)) 十 Citpo3 二 Coth>r 1 (5.25) 


as long as || ze 人 (加 | < 6. By (5.24) this then implies 
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| ze 人 2 < ze(0) 上 十 Crtp53 二 Cotpy+l 十 C37252. (5.26) 


The assumption || ze( 归 | 科 0 is certainly satisfied as long as C1tP6 苹 1 /4， 
Cothv+l < 62/4, and Csjhi2 < 1/4, so that the right-hand side of (5$.26) is 
bounded by 602. This proves not only the estimate for || ze( 昌 | but at the same time 
it guarantees Tecursively that the above construction of the functions y( 旭 ,ze( 切 i 
feasible. 














Notice that for initial values computed by a suffticiently accurate one-Sstep 
method the constant 0 can be chosen as small as O(jr+2) where r is the order 
of the multistep method (cf. Lemma 3.7). The above estimates are therefore valid 
on very long time intervals. 


卫 xample S$.0. TIo illustrate the long-time behaviour of the parasitic terms 2 We 
consider the pendulum equation g = 一 Sin q, and we apply the Symmetric multistep 
methods (B) and (C) of Example 1.2. For method (C), the starting values are chosen 
far from a smooth solution， so that the propagation of the Parasitic terms in the 
numerical Solution can be better observed. We compnute the velocity approximation 


一 了 (sw dn-1) 一 (9n+2 oa)) (5.27) 


























Fig. S$.2. Stable propagation of perturbations in the starting values for method (C) of Example 
1.2; Initial values are oo = 1.141 al = 1.158, 9q2 = 1.178, and as 王 1.206 























Fig. S$.3. Unstable propagation of perturbations in the starting values, for method (B) of Ex- 
ample 1.2; initial values are go 三 1.147 9ql = 1.183, q2 = 1.255, and 9qs 一 1.286 
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Figure 5$.2 shows the numerical solution (an,vn) form > 2. The values for 7 三 
2;,3,4,5 are indicated by larger black bullets. The parasitic roots of method (C) are 
士 and both are Simple. The numerical solution ls therefore of the form 


dn 三 WP 十 各 2(7 太 ) 十 (一 人 ”21(m7) 十 (一 ]"z2(7). 


One observes in Fig. 5.2 that the functions 2 ( 力 not only remain bounded and small， 
but they stay essentially constant over the consldered interval. This Should be com- 
pared to Fig.3.1, where the parasitic functions zj ( 妈 are bounded, but not constant. 

Method (B) has a double parasitic root at 一 1 and, therefore, is not s-Stable. 
Its numerical solution behaves like qn 三 V( 彤 ) 十 (一 1)7z(m 几 ). In Fig.S.3 every 
Second approximation is drawn in grey. One Sees that the numerical Solution Stays 
on two Smooth curves Vy( 坟 十 zand yy 的 一 2 罗 which, however do not remain 
close to each other. 





XYV.6 上 xplanation ofthe Long-Time Behaviour 


The bounds on the Parasitic Solution components of Sect.XV.3.3 allow us to get Tig- 
orous Statements on the long-time behaviour of multistep methods (3.7) for Second 
order differential equations. The following results are taken from Hairer 儿 Lubich 
(2004). We do not know of Similar results for multistep methods (1.1). 


XYV.0.1 Conservation of Energy and Angular Momentum 


The energy conservation ls now a direct consequence of Theorems 35.3 and 3.3. We 
Shall use the representation of qn in terms of functions Vy( 轨 ,ze( 为 as in Theorem 5.5. 
Taking into account the jump discontinuities of these functions, Theorem 5.3 yields 


ID 2 的 到 的 ) = 30) (0) 二 OU) 二 OU) 


Wehave0 = O(nr+l) 证 the starting approximations are computed by a rth order 
one-step method. 玉 N is chosen sufficiently large, this together with (3.19) implies 


琴 80) = 五 (0)00O)+OUP) for 0<<T7T 一 OO 人)， 


Hthe velocity approximation pn 王 vn ls given by a rth order finite difference 
formula (3.11), it follows from Theorem 5.5 that pn = V(2P) 十 O(j7) provided 
the truncation index N is sufhiciently large. This proves the following result, and 
explains the excellent long-time behaviour of method (C) in Fig. 1.2. 


Theorem 6.1 (Total Energy). For a proplem d = -VDU(d) wiipn total enerey 
万 (p,d9) 三 3D77D 十 (dg)， te 1aarerical solution or am s-s1GDle Sy1a11etric 1I21- 
11step 1Ie1jpod (3.7) of order7 Satisjies 


五 (qu pn) 一 五 (qo, Do) 冰 O(A) Jor 7 已 扩 厅 


太 1po Joof of p(G) other tpan1Tiaproduct ortno otper yoofy, ipe statemae1t ol1ds 
o1 infervalsy of enrg 态 O(] 23) 
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We assume next that the differential equation 4 = --VU(9q) has a quadratic 
first integral of the form Z(doo) = 民 49 (e.g.,the angular momentum in N-body 
problems). This means that 4 is skew-symmetric and VU(dq749 = 0. The last 
equation can also be interpreted as the invariance relation U(er4d) = U(d). This 
property implies for UL(z)，given by (5$.9), that M(er4z) = W(z) (here er4z 一 
(er4ze)jeezr). Along solutions z(b of the modified equations (5.10) we therefore 
have up to terms of size O(A) 

q 7TA4 全 一 2 > 了 必 刀 
0 2(e Z) 一 >》 2z74V。 ,WU(zJ)= > ze4(4)(Cee )ze. 


dd7 =- 
二 LE 工 LEI 


Ice(C) has aroot Ge, then the corresponding term is omitted from the last sum, lead- 
ing to aremainder term which in the worst case is O(1364), as in Theorem 5.3. Like 
in the previous proofs, the last sum is, for skew-symmetric 4, the total derivative of 
afunction 


(信芳 末 ) = To( 信 人 2) 十 十 内 NI( 沪 仿 玫 ) 
which satisfies (under the Same asSsumptions as in Theorem 5.3) 
CD ID, 的) = CO) .90),a(0)) +OGRa6 + OURNT) 


and 
必 仿 2) 三 工 ( 轨 人 态 十 OP) 十 O(02/ 六. (6.1) 


We therefore obtain the following result. 


Theorem 6.2 (Angular Momentum). Zef ZL(qo) = 民 49 pe ajJizrsf integral oF 
4 = 一 VU(q). THpe aaerical solution of an s-s1aple Sy11a11ietric 1IUistep 11ethod 
(3.ZJ) ofordery 1pen salisjies 


7(owpn) 王 Zoopo)+O) 放生 


矿 1po Joof ofp(C) otper ta lmaproduct ortpwo otper yoofy thpe state1aent polads 
o1 infervalsy of enrg 态 O(] -2 一 3). 














XV.0.2 Linear Error Growth for Integrable Systems 


The differential equation 4 = -VU(d), written as dd = UV = -VU(q)，is 
Ieversible with respect to the involution v 上 一 0. Assume that it is also an integrable 
System in the sense of Deftinition XI.1.1, and denote by a = 7T(a,u) the action 
variables, and by w(a) the frequences of the system. 

By Theorem 5.5, the numerical solution can be written as qn 三 V(mP) 十 
> vc7rs G22ze(0A)，wWwhere (at least locally) Vy(b is the solution of a modified dif- 
ferential equation (first equation of (3.25)) 


六 = 记 0(0 太 ) 十 态 j(0 信 2) 十 .十 jw-1(0 信 玫 ) (0.2) 
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which, for z” = 0 becomes the reversible modified differential equation (3.9). Since 
zj 人 一 O(O) (See Theorem 5.5) and since z* appears at least quadratically in (6.2)， 
this equation is a O(62) perturbation of (3.9). We are now in the position to apply 
the results of Lemma XI.2.1 and Theorem XI.3.1. The additional (non-reversible) 
perturbation of size O(62) in the differential equation (6.2) produces an error term 
of size O(t62) in the action variables and of size O( 妇 62) in the angle variables. 开 
0 一 O(1rf+li, these terms are negligible with respect to those already appearing in 
Theorem XI.3.1. The errors due to the jump discontinuities (Theorem 5.3) are also 
negligible. We have thus proved the following statement. 


Theorem 6.3， Co7zsider GDPL1ying 1pe 8-S1GD1e 8Sy111111etric 1IUListep 11etpod (5.7) of 
order7r io a1 jzteg1aple Jeyve1siple System 0 = 一 VU(O) wii 7eal-anaa1ytic pote1- 
HCL U.Suppose 1jatf wr E 形 d satisjiesy 1pe diopjpan1fine comdifion ( 筷 2.4) 77jem 
加 e1e eXist Dosifive co1181a11 CCc da11d Po SUcH 1at 1pe jliowmwing polds jor a11 
SteD Sizeg 几 区 凡 0: eyery 11U1ie1ical solution (qn Un) Sta1tn8g WiU Jeguencies 
wo 一 w(T(qo,zo)) sucn tpatllwo 一 wx < cllogjl- 一 2 satisjies 


en on) 一 (qu 证 
17 von) 二 7T(ao,vo)|| 


刀 CA 
Jor 0 芯 1=7 几 区 几 
Cj 











7T71e comstan1s Po,c,C depe1id oa dy,Z ad oz pox1ads or tiPe pote1atiaL. 





XV.7 _ Practical Considerations 


In computations with multistep methods one can observe resonance phenomena, 下 
relatively large step Sizes are used. This and the use of variable Step Sizes are the 
Subject of this section . 


XVYV.7.1 Numerical Instabilities and Resonances 


Soon after Quinlan and Tremaine's methods were published,， however， 
Alar Toomre discovered a disturbing feature of the methods, . . . 
(G.D. Quinlan 1999) 


It is a Simple task to derive multistep methods of high order. Consider for example， 
methods of the form (1.8) for second order differential equations y = jy). Their 
order is determined by the condition (1.9). We choose arbitrarily p(C) such that 
4 三 1isadouble zero and the stability condition is Satisfied. Condition (1.9) then 
gives 


cg =pO/ogs +O( 一 DT 站 


Expanding the right-hand expression into a Taylor series at 5 = 1 and truncating 
Suitably, this yields the corresponding o polynomial. If we take 


p(O=(C 一 1 十 6 十 人 十 全 十 1 (7.1) 
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Table 7.1. Symmetric multistep methods for second order problems; 天 二 8and order 了 一 8 



































SY8 9SY8B 9Y8C 
2 ai | 12096 Qi | 120960 ai | 8640 [ 
0 工 0 工 0 工 0 
1 | 一 2 17671 0 192481 | 一 1 13207 
2 2 | 一 23622 0 6582 0 一 8934 
3 | 一 ! 61449 | 一 172 816783 0 42873 
4 0 | 一 50516 一 ] | 一 156812 0 | 一 33812 














we get in this way Method SY8 of Table 7.1, a method proposed by Quinlan 
Tremaine (1990) for computations in celestial mechanics. All methods of Table 7.1 
are 8-Step methods, of order 8, and Symmetric, ij.e., the relations ai 三 Qk_i and 
Di = iaresatisfied. Therefore, we present the coefficients only for < /2. 
These methods give approximations yn to the solution of the differential equa- 
tion. Ialso derivative approximations are needed, we get them by finite differences， 
e.g., for the 8th order methods of Table 7.1 we use 
on 二 二 (672 (yn 一 yn-1) 一 168(yn+2 一 rn-2) 


(7.2) 
十 32 (yn+3 一 yn 3) 一 3 (yn 4 师 汉 | ， 


We apply this method to the 开 epler problem (LI.2.2), once with eccentricity e 一 0 
and once with e = 0.2, and initial values (L.2.11), Such that the period of the exact 
Solution ls 27. Starting approximations are computed accurately with a high order 
Runge-Kutta method. We apply Method SY8 with many different step Sizes Tanging 
from 2r/30to 2r/95, and weplot in Fig.7.1 the maximum error of the total energy 
as afunction of2r/ 尺 (where 到 denotes the step size). We see that in general the error 
decreases With the step size, but there is an extremely large error for 几 久 27r/60. 
For e 和 0, further peaks can be observed at integral multiples of 5 and 6. It is our 
aim to understand this behaviour. 


JInstabilities. We put zx = 91 十 2q2, So that the Kepler problem becomes 
疼 一 (zh)z，。， W(r) = 一 r 


and we choose initial values such that z( 吉 一 et is a circular motion (eccentricity 
e 王 0). The numerical solution of (1.8) ls therefore deftined by the relation 


天 开 
> oz 一 甩 >》， 记 凡 |znti)zn (7.3) 
J=0 J=0 
Approximating W(|2+j) with V(1) = 一 2，we get a linear recurrence relation 


with characteristic polynomial 


5S(og=pOTwic(9， 


004 XV. Dynamics of Multistep Methods 





























100 上 
method SY8 
10-3 
10- 
ee 一 0.2 
-9 
10 o -0 
Steps per period 
Eee 全 SEE 下 和 辣 | 和 Ti 下 下 和 让 25 富生 -和 
40 50 60 70 80 90 


Fig. 7.1. Maximum error in the total energy during the integration of 2500 orbits of the Kepler 
problem as a function of the number of steps per period 


The principal roots of S(wj 6) = 0 satisfy Ci(wj) se and Co(wP) 交 ee 
and we have |Gi(wPpj| = 1foral7y and for sufficiently small 九 ,because the method 
is Symmetric (Exercise 2). As a consequence of |Gi(wp)| = 1 the values 力 :一 
G1(wA” are not only a solution of the linear recurrence relation, but also of the 
nonlinear relation (7.3). Our aim is to Study the stability of this numerical Solution. 
We therefore consider a perturbed solution 


二 Gi(wh)" (1 寸 an) 


Using |zo| = 工 十 革 (un 十 到 ) 十 O(uwnl2) and neglecting the quadratic and higher 
order terms of jn| in the relation (7.3), we get 


玉 六 2 太 
>》 (axj 和 w2120j)G (wj)" un 一 可 俏 () >》， DiGLCwj)” (un Zn-. 
7 一 0 


JIJ 王 0 


Considering also the complex conjugate of this relation, and eliminating 立 n+7，wWe 
obtain a linear recurrence relation for wn with characteristic polynomial 


Shi Goho So Go 一 GO 二 OO (7.4) 


For small j, its zeros are close to Ci(w 四 -1G and Gi(wj)G. Iftwo of these zeros 
collapse, the O(j2) terms in (7.4) can produce a root of modulus larger than one， 
So that instability occurs. This is the case, 证 two roots 6Gi，GL of p(C) 三 0 satisfy 
GO 全 人 和 ee or 


47 
0 一 几 = 六 ， (7.5) 


where 6 = ez and 几 一 27r/V. 

For the Method SY8 of Table 7.1, the spurious zeros of p(C) have arguments 
47r/5, 士 2r/5, and 士 2r/6. With 0) = 2r/5 and 0 = 2r/6, the condition (7.5) 
gives N =60 as acandidate for instability. This explains the experiment of Fig.7.1 
fore = 0. Astudy of the stability of orbits with eccentricity e 夭 0 (see Quinlan 
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Fig. 7.2. Maximum error in the total energy during the integration of 2500 orbits of the Kepler 
problem as a function of the number of steps per period 


1999) Shows that instabilities can also occur when 4r/V is replaced with 2gr/N 
(gd 三 2,3,...)intherelation (7.5). 

To avoid these instabilities as far as possible, Quinlan (1999) constructed Sym- 
metric multistep methods，where the spurious roots of p(6C) = 0 are well Spread 
out on the unit circle and far from( = 1. As a result he proposes Method SY8B 
of Table 7.1. The Same experiment as above yields the results of Fig.7.2. The p- 
polynomial of Method SY8B is 


p(O=(C- 世 (+26+30+3.56 十 3 人 十 26 十 蔬 ， 
and the 0j of the spurious roots are 士 2r/2.278, 士 2r/3.353, and 士 2r/4.678. The 
condition (7.9) is Satisfied only for NM < 23.67, which implies that no instability 
occurs for e 一 0inthe region of the experiment of Fig.7.2. 


To illustrate the importance of high order methods, we included in Fig.7.2 the 
results of the Second order partitioned multistep method (1.153). 











XYV.7.2 了 上 xtension to Variable Step Sizes 


Variable Step Size multistep methods for second order differential equations V 一 
jyV) are of the form 


K 大 
>》， Qaj(PRn， 5 克 区 yn+7 一 7 >》， Li(Rn， nrTE-1) Jr)， 
J=0 JI 一 0 
Where the coefficients oj and Dj are allowed to depend on the Step Sizes jn .， 
jn _1, more precisely, on the ratios PT1/Pn ,An In 2.They yield 
approximations Wn to y( 如 ) on a variable grid given by 如 +1 一 如 十 jn Such a 
method is of order 7 (cf. formula (1.9)), 让 


天 天 
>》 oj( 人 (Po ait 一 有 1 >》 0j(n jntR 1) 共 押 - 
JJ 三 0 IT=0 

(7.0) 
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for all polynomials v( 芭 of degree < 了 十 1.Itis stapjie, 让 the p-polynomial with 
coefficients aj( 刀 ,..., 尹 ) (constant step size) Satisfies the stability condition of 
Sect.XV.1.2 (See Theorem IJIII.3.7 of Hairer Newrsett & Wanner (1993) and Cano 
信 Durin (2003a)). 

All methods of Sect.XV.7.1 can be extended to Symmetric, variable Step Size 
integrators. This has been discovered by Cano 多 Duran (2003b). For clarity of no- 
tation we let Q 放 记 (7 = 0,...,[) be the coefficients of Such a fxed step Size 
method. Cano 久 Durin propose putting 





An 
的 击 二 和 的 汪汪 一 六 Oh (7.7) 
也 十 天 一 工 
and to determine aj(jn ,An _1) Such that Symmetry and order 大 一 2 (for 


arbitrary Step Slzes) are achieved. We also Suppose (7.7)，but we determine the 
coefficients aj(jn ,AnHk_1) Such that (7.6) holds for all polynomials y(t of 
degree 冬 1. This uniquely determines these coefficients whenever PP > 0)...， 
ji_1 > 0(Vandermonde type System) and gives the following properties. 


Lemma 7.1. For eyem 及 /el (Qhi 启 ) dejije qd Sy1i111e1ric，S1GDIe K-sfeP 111etpod 
(1.8) of order 帮 a1nd co1msider 1ne yariaple step Size 1ietjpod give py (7.7) Ga11d 
oaj(jn yyPnHE_1) SUCR 1Pat(7.O) 1ol1ds jpor allpoiynommials y Sats 太 ing degy 苹 
K. 7T7118 11Ietjpod exie1ds 1Pe jixed step Size ja ie.， 


oj( 关 = 起 有 = (7.8) 
it Sa1is1iesy 11e 8Sy11111e11y 17eLG1IO1S 


全 开启 太 及 1 和 击 全 三江 矶 井 
1 要 Di1(Pn， 这 jnAK_1) 12 DPK 1 人 邦 站 )， 


G110 1 ofo1der 开 一 1 Jorarpitary step Sizey. Mo1eoveri 这 pehavesy Like da 711Ietjpod 
oforder 玉 矿 jrH = 加 ( 人 (二 O(a)) atoraly 于 郊 


(7.9) 


| 





Proof The relation (7.8) for Dj follows at once from (7.7), and for aj it is a conse- 
quence of the uniqueness of the solution of the linear System for the oo 

The second condition of (7.9) follows directly from (7.7) and from the Sym- 
metry of the underlying fxed step Size method (9 7 一 厅 for all 7). Inserting 
(7.7) into (7.6), replacing V(b with y(t 如 十 如一 胃 ,and reversing the order of 
有 ,An 1yields 


天 大 
有 
7=0 J=0 
Using 所 _ 记 this shows that ok_i(PnHk 1) .jpn) satisfies exactly the Same 
linear System as Qj(Pn .AnHk_1), Sothat also the first relation of (7.9) is veri- 


fied. 
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By deftinition, the variable step Size method is at least of order 不 一 1. Under the 
assumption jn+1 一 几 n (1 十 O(zn)) the defect in (7.6) is of the form 
jiD(2o jn) 二 +D(2 jn) 十 O(MAT2) 


for all sufficiently smooth y( 芒 . Since the constant step Size method is of order K， 
the expression 刀 (Pn ,jn)is of size O(jn), So that we observe convergence of 
Order K. 














The Symmetry relation (7.9) has the following interpretation: if the approxi- 


mations yn,...,VnAk LUsed with step Sizes PP ,HPn+k1yield yz then the 
values yp ,Vn+lapplied with PPn yield yn as aresult (Since the 
coefficients ay and CO only depend on step Size ratios and the mnultistep formula 
only on 1 the Same result is obtained with 一 岂 1 ,一 pn). This is the 


analogue of the definition of symmetry for one-step methods. 

For obtaining a good long-time behaviour, the Step Sizes also have to be chosen 
in aSymmetric and reversible way (See Sect. VIU.3). One possibility is to take Step 
SiZeS 


hotel 引 (coneD+aGnrn)， (7.10) 


where s > 0, and oly) is a given positive monitor function. This condition is an 
implicit equation for Rn+kz -li, because yn depends on 媳 n+-1.Ithastobe solved 
iteratively. Notice, however, that for an explicit multistep formula no further force 
evaluations are neceSSary during this iteration. Such a choice of the Step Size guar- 


antees that whenever ATk-1 is chosen when stepping from yn ,yn+k1 with 
jn ,An ato yythestep Size An is chosen when stepping backwards from 
Vn 二 Pi Vn+1 With PE jnHlto on 


JImplementation. For given initial values yo, yo, the starting approximations V1， 
..,W_1 Should be computed accurately (for example,， by a high-order Runge 一 
Kutta method) with step Sizes Satisfying (7.10). The solution of the scalar nonlin- 
ear equation (7.10) has to be done carefully in order to reduce the overhead of the 
method. In our code we use jnTR-1 :一 凡 2 上 2 /hn -aaspredictor and we apply 
modifed Newton iterations with the derivative approximated by finite differences. 
The coeffhicients w 了 (jn .hn 1i) have to be computed anew in every itera- 
tion. We use the basis 


“一 工 


IJ 一 0 


for the polynomials of degree 和 Kin (7.60). This leads to a linear triangular System 
for ao,...,Qk. As noticed by Cano 上 Durian (2003b), the coefficients pi( 好 ) and 
永 ( 帮 ) can be obtained efficiently from the recurrence Telations 


Do( 夫 三 1， Vi 的 一 企 一 友 )PDi( 芍 
Do 三 0， Di = 过 一 藤 ) 广 ( 鸭 十 衣 
六 王 0， 旋 榴 王仁 一 攻 ) 永 (十 2Di(， 
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Fig. 7.3. Maximum error in the total energy during the integration of 2500 orbits of the Kepler 
problem as a function of the number of steps per period 


During the iterations for the Solution of the nonlinear equation (7.10) only the values 
of pi( 妃 +) have to be updated. 


Numerical 了 xperiment，YWe repeat the experiment of Fig.7.1 with the method 
SY8, put this time in the variable step size version and with ol(y) = yl as step 
SiZe Inonltor. We have computed 2300 periods of the Kepler problem with eccen- 
tricity e 一 0.2, and we have plotted in Fig.7.3 the maximal error in the Hamiltonian 
as a function of the number of steps per period (for a _ comparison we have also 
included the result of the fxed step Size implementation). Similar to (7.2) we Use 
approximations yn that are the derivative of the interpolation polynomial passing 
through yn, yn 二 1l, Vn 土 2, ... Such that the correct order ls obtained. The computa- 
tion is stopped when the error exceeds 10 一 2. 

As expected, the error is Smaller for the variable step Size verslion, and it js Seen 
that the peaks due to numerical resonances are now Imuch less although they are 
not completely removed. For large Step Sizes, the performance deteriorates, put this 
is not a Serious problem, because these methods are recommended only for high 
accuracy computations. 

It Should be remarked that the overhead, due to the computation of the coef6- 
cients oj and the solution of the nonlinear equation (7.10), is rather high. Therefore， 
the use of variable step Sizes is recommended only when force evaluations FV) are 
expensive Of When constant step Sizes are not appropriate. Cano 多 Durin (2003b) 
Teport an excellent performance of Symmetric, variable Step Size multistep methods 
for computations of the outer Solar System 














Despite the resonances and instabilities, then，Symmetric methods can 
still be abetter choice than Stormer methods for long integrations of plan- 
etary orbits provided that the user is aware of the dangers. 

(G.D. Quinlan 1999) 
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XV.S Multi-Value or General Linear Methods 


General linear methods is a class of integration methods that covers Runge-Kutta 
as well as multistep methods. It is therefore of interest to study which of the results 
on the long-time behaviour can be extended. 
So-called multi-value or general linear methods are deftined by 了 +1 = Gp(2)， 
where 
2 三 万 于 十 PPBHDT) 


UL = C 取 十 4 


with FTD = (7 7 for Da = (and 
球 王 (人 .内 ). Weuseasloppynotation in the sense thatthe matrices 刀 , 成, .. 
Should be replaced with 忆 Q@1DBQ@TI1... .Foracomputation, astarting procedure 
om and a finishing procedure z，which extracts the numerical approximation yn 
from Yn, have to be added (see Fig. 8.1). We assume the existence of a vector e Such 
that with 卫 一 (1 ,了 1) 





(8.1) 





De = e， Ce 一 1 (8.2) 


holds (preconsistency conditions). The vector yu is then an approximation to ey(t) 
(more Precisely to e @ V(tn)). 

For Runge-Kutta methods, 刀 = (1l) is the one-dimensional identity, 已 = 
(2 0),C = 和 and4istheusualRunge-Kutta matrix. For multistep methods， 
we have 到 = (on andDisthe 大 xx 大 matrix with characteristic 
polynomial p(C) as in (2.1). For a detailed treatment of general linear methods we 
refer the reader to Chap.4 of the monograph of Butcher (1987), and to Chap.IIU.8 
of Hairer Ngrsett & Wanner (1993). 


Gn Gn 
了 7 了 也 


急 和 @ 外 

20 21 2V2 

Fig. 8.1. Ilustration of a multi-value method 了 21 三 Gn( 殉 ) with starting procedure 9n 
and finishing procedure Z 


XYV.8.1 Underlying One-Step Method and Backward Error 
Analysis 


In analogy to multistep methods, a method (8.1) is called strictly stable, 过 all eigen- 
values of D are inside the unit circle with the exception of the Single eligenvalue 
46 三 1. An extension of Kirchgraber's result (Theorem 2.1) to strictly stable general 
linear methods ls given by Stoffer (1993). 
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Theorem 8.1，Co7sider astrictiy staDie ge1- 


e1al 1iear 11etbod 孜 +1 三 Gj( 卫 )， ad a xo Gy 1 
Joispizpg procedue on = 有 柬 ( 殉 ) = 哎 十 


@O(J. 4ssuze 1patf(8.2) and de=1jpo 以 





(2 7Tje1 1pere exist Q UVU1IGUe oO11e-StePD /97 /on 
1ae 太 od 环 (V) Ga12d a UNigue Startii8 P1oce- 7 
ae on) SUcp tar Gh ooch = 9hoGh 功 信 


Ga1d Zoo7 王 7TQ Ho1Q. 
( 刘 Tje aa Ah = {fSn( 人 ;VE Rd 5 iovariant zader Gh and 开 他 
EXDO1e11IGLLy GaCIIVe. 


Prooj Since the method is strictly stable, there exists a matrix 7 Such that 


1 0 


和 汗 有 
全 有 PNv 


】 with | Doll < 1， 


and Tel = e (where el = (1,0,...,0)7). The proof closely follows that of The- 
orem 2.1. With the transformation (上 En， 11m) 工 一 2 = 了 -1 ,the general linear 
method (8.1) becomes 


6n+ 到 Cn 
( 人 】 全 ( Do77n ) + 亲 BC) (8.3) 





with Un = CT 2 十 AL4FUn+T) .Asbefore, Theorem XII3.1 can be applied and 
yields the existence of an attractive manifold A. = {(6s( 避 ); ee 开 4, which is 
invariant under the mapping (8.3). We now invert the restriction of Z7，onto the 
manifold AM Due to dre = 1 and Tel = e,wehavefor2 = 2 = (4 s() 
that 

一 及 (人 TZ(9) =dTZ(9+…= 一 9(9， (8.4) 


where 9g(6C) is Lipschitz continuous with constant O( 户 ). By the Banach fxed-point 
theorem the equation (8.4) has a unique solution < = 7( 妇 . Putting 


Sn) =T2Z(r(J) = 】 


we have found the unique starting procedure satisfying Zoo9n = 7d and 
7T-19n(J)EA Wefinally define 5 = 下 oGhoShn andA 人 ={17T2; ENA 
So that all statements of the theorem are verified. 














It is our aim to extend the concept of an underlying one-step method to nearly 
all (including weakly stable) general linear methods. 


Theorem 8.2， Co7zsider a 8e1e1al 1inear 11ethod (8.1) da1d assu11e 1pat 5 = 三] 
ia Si118le eige1value of tpe Propagation IIa11rix 万 . Fourtpermaioye， er Gh(Y) a1ad 
末 (7) = 开 科 十 .jpave expansions 识 powers of 放 azmd assu1ie 1jpaf (8.2) az 
die=17j1old 7Tjen 1Pere exisf aunaigue 加 raal ome-sfep maetjpod 
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酌 ( 人 =2 十 Pd) 十 jda( 仿 十 
G1C a U11GUe Jo711aL stG111118 P1ocedu1e 
3( 切 一 二 十 PS1( 纺 十 大 Sa( 轨 十， 
SUCP IatjpPaaly Ghoo 三 oogh and 有 oo =7TQ Pold. 


Proof Expanding Sun(@n( 人 )) and Gn(Sn(y)) into powers of ] a comparison of 
the coeffcients yields 


edj( 人 十 (一 疡 )5j() 三 ， (8.3) 


where aright-hand side depends on known functions and on di(y), Si(y) withz < 了. 
Similarly, the condition 玉 (Sy(y)) = yleads to 


dSi(W) = (8.0) 


Dueto the fact thatC = 1is asingle eigenvalue of D, and that dx e 天 0, the system 
(8.5)-(8.6) uniquely determines dj(V) and 97(7)， 














Backward Error Analysis for Smooth Numerical Solutions. The formal analy- 
Sis of Chap.IX can be directly applied to the underlying one-step method of The- 
orem 8.2. This yields a modified differential equation, but only for the Smooth nu- 
merical Solution (cf. 9ect.XYV.3.1). Notice that this modified equation depends on 
the choice of the fnishing procedure 有 . 


XYV.8.2 Symplecticity and Symmetry 


Before giving a precise meaning to the Symplecticity and symmetry of general linear 
methods, we establish the following lemma. 


Lemma 8.3. Fopr a ge1e1al 1inear 1paetjpod 世 T1 = Gh( 隐 ) we comasider fo di1e 广 
e1tJi1is1i78 Pocedu1es 1 三 刺 ( 隐 ) aad 办 三 灰 ( 芭 ) : 











瑟 ) 1 五 2 瑟 P 
0 2V1 2 
已 | 访 | 访 | 访 
蕊 GP 六 Gn 蕊 GP 
sm nm | 
厂 ) 瑟 ) 五) 
0 2V1 2 


7T7jpe hpo correspo1dijzg o1e-step 1ietpodsy (V) aad 有 ( 切 (give py 77jeo- 
1e111 8.2) da1e 如 ea co1jzgaie 10 eacp oljper 1e.， 


arlogodh = yi 而 = 下 oo8 (8.7) 
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Prooj The equations involving the underlying one-step methods or the starting pro- 
cedures have to be understood in the sense of formal series. By Theorem 8.2 we have 
Si( 人 妇 =ey+O( andalso 包 (人 =ey 二 O( 站 .Itthusfolowsfrom 有 os = 7 
that an(y) is O( 几 )-close to the identity and therefore invertible. 














The transformation an in the phase Space is @O(P)-close to the identity. The 
Telation oil o 9 o oh 一 GT， Which is a consequence of (8.7), therefore implies 
that the numerical solutions of 2 and G Temain @( 尹 )-close for all times. This 
means that the long-time behaviour of both methods is exactly the Same. 

Consequently, for a given general linear method Cn it is sufficient to require 
Symplecticity or Symmetry for ome fnishing procedure only. 


Definition 8.4 (Symplecticity). A_ general linear method Cn is called symazPpliectic 
让 there exists a fnishing procedure zsSuch that the underlying one-step method 
2 of Theorem 8.2 is symplectic, ie., 2 (JE (V) = Jin the sense of formal 
SefieS. 


The study of symplecticity of linear multistep methods (Sect.XYV.4.1) was rather 
disappointing. We couldnotfind one linear multistep method whose underlying one- 
step method is Symplectic. For general linear methods, Some neceSsSary conditions 
for the Symplecticity of the underlying one-step method are known which are hard 
to Satisfy (Hairer 久 Leone 1998). For the moment，no Symplectic general linear 
method (not equivalent to a one-step method) is known, and we conjecture that Such 
a method does not exist, even in the class of partitioned general linear methods 
(treating the pz and d variables by different methods). 

Afterthe disappointing non-existence conjecture of symplectic multi-value meth- 
ods, we turn our attention to Symmetric methods. We know from the previous chap- 
ters that for reversible Hamiltonian Systems, the long time behaviour of symmetric 
one-step methods can be as good as that for Symplectic methods. There are Sev- 
eral definitions of symmetric general linear methods in the literature. However they 
are either tailored to very Special Situations (e.g., Hairer, NUrsett 久 Wanner 1993)， 
or they do not allow the proof of results that are expected to hold for symmetric 
methods. 


Definition 8.S (Symmetry). A general linear method Cm is called sy7z11zzetric 让 
there exilsts a finishing procedure zz Such that the underlying one-step method @7 
of Theorem 8.2 is symmetric, ie., @_hn(y) = 有 1(y) in the sense of formal series. 


了 xample 8.6， Consider the trapezoidal method in the role of G7 and the explicit 
Euler method with step Size 一 7yR as finishing procedure: 
用 
Gn : Yo+l 三 Ym 十 了 (人 下 j0G54) 
了 7 ， Zn 二 1 一 +1 二 7 有 FT1) 


The corresponding starting procedure and underlying one-step methods are then the 
implicit Euler method and the following 2-stage Runge-Kutta method: 
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7 : 对 三 名 十 ?AID) 
人 小 
Gh : Runge-Kutta method 1+7| 1/2+7 1/2 
1/2 十 yY 1/2 一 MY 


The method 2 is symmetric only for y = 0, for7y = 1/2, andfor7y = 一 1/2. 
This example demonstrates that the Symmetry of the underlying one-step method 
strongly depends on the finishing procedure. 

On the other hand, this example Shows that the 2-stage Runge-Kutta method 
is Symmetric in the Sense of Deftinition 8.5 for all 7y (because it is conjugate to the 
trapezoidal rule). It is not Symmetric according to the definition of Chap. V. 








A Useful Criterion for Symmetry. Definition 8.3 is rather impractical for verifying 
the Symmetry of a given general linear method. We give here algebralic conditions 
for the coefficients 4, 已 ,C, D of a general linear method (8.1), which are sufficient 
for the method to be Symmetric. We assume that the finishing procedure yn+l1 王 
柬 ( 癌 1 is given by 


gr 万 了 TAB 了 态 H=GCHT+A4HD， (08.8) 
in complete analogy to method (8.1). 


Lemma 8.7 (Adjoint Method). Zer 芭 +1 = Gp( 区) pe te general1inaearmetpod 
Sivel py 4, 成 ;CD IOiiiovertipie 有 1 = 柬 ( 交 1) pe Jisjing Proceduye 


Sive1l py 4; 忆 ,CD, and denrote by DJ 办 Ider1ying one-step 11ethod 7Tjpem 加 e 
ZU1Gerlyi128 oO11e-steDp 111etpod of 


CT7 : 4 一 CCD-IB--4 有 ==DTIB，C0r CD 用 =D-1 


下 4 和-4 有 -5 0=G Dr=D 万 
1 1je adjoint 11ethod 97 三 G-1 OF 2 人) 
Poof Substituting 中 全 一 六 and 节 +1 人 区 in(8.1) yields 
Di=CIH1 一 APA4FDn+TD)， 区 王 万 了 1 一 ABCDn+i) 
Extracting 站 1 from the Second relation and inserting it into the first gives 


Un+l 王 人 十 A(CD-I 和 4)FDn+1) 
芒 H1 三 丰 -你 十 ADTIBJFDnHi)， 


Which is exactly method C7. The same replacements in the finishing procedure 


Ts+1 < CTY， 1 j4707)， zym 一 万 芭 虽 7/ 巨 F(Z) 











and in the diagram of Theorem 8.2 prove the statement, 
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Theorem 8.8. 矿 1Pere exist da1a ivertipie 1atrixr QQ (Satishipg GOe =emwithegiven 
DY (8.2))a11d a Petatio1z 1IG1rixX 已 SUCcP tat 


P-I4P = CD-IB- 4 Q-LBP = D-1B， 


P-ICQ = CD-1 Q-1DQ = D-1 人 


1je1 1je 8e1le1QL Li11ear 1Ietpod (8.7) 18 8Sy11111e117C. 


Poojf We consider the change of variables 世 = 名 辫 ， 六 人 in the method 
(8.1). Since P is a permutation matrix，we have FLPUD) = 已 AD), so that the 
method becomes 


P 世 = CQ 护 TN4PHBDD， 9 = DO 六 +PBPHED 


The assumption (8.9) implies that this method is the Same as the adjoint method 
of Lemma 8.7. Taking a finishing procedure Zin Such a way that yn+1 一 
末 (Q 人 区， is identical to the finishing procedure 加 +1 一 六 (人 区 1) of the ad- 
joint method (ie., = 0 and 万 such that DQ = 记 ), we obtain @G; = @. This 
proves the Statement. 














The sufficient condition of Theorem 8.8 reduces to the known criteria for clas- 
Sical methods. Let us give Some exXamples: 


e。 For Runge-Kutta methods we have 万 = (已 = 好 arowvecton andC = 1 
With CQ = (1) and 己 the permutation matrix that inverts the elements of a vector， 
We get 

刀 = 好 ， P4P=1o 一 4， 


which is the Same (V.2.4). 
Multistep methods in their form as general linear methods (Sect.XV.8) Satisfy the 
condition of Theorem 8.8 让 


ai 一 一 Qi 永 = 所 -. (8.10) 


One can take for 尸 and Q the permutation matrices (inverting the elements of a 
vector) of dimension 上 十 1 and pi, respectively. 


XV.8.3 Growth Parameters 


For a Tigorous study of the long-time behaviour of general linear methods it is not 
sufficient to investigate Smooth numerical solutions. One has to get bounds on the 
parasltic solution components, which are present when one considers the general lin- 
ear method without any starting and finishing procedure. This ls certainly diftcult， 
as it ls for multistep methods (1.1). We restrict here our analygsis to the linearized 
parasitic modified equation. 

The eigenvalues of the matrix 刀 in (8.1) will play the role of the zeros of (6C) ip 
(1.1). We denote them by 61 = 1 and ,Gand weassume thatthey are Simple 


XV.9 Exercises 015$ 


and of modulus one. Motivated by the analysis for multistep methods we write the 
approximations Yas 


落 =Y(nzT >》 人 Ze(nj (8.11) 
LEZ* 


with smooth functions 郊 (为 and Qu. The index setZ* has the same meaning as in 
Sect.XV.3.2. We insert (8.11) into (8.1) and compare coefficients of 6 . This gives 
with 夺 王 m 必 





YLTThM = DYO+TABHFCYG) +O(A2) 攻 
Get 站 = DZ 鸭 +TPBJCYG)C2e0 TO(C). 
To get an amenable form of the modified equations we write the vectors 开 (四 2e (区 
in the basils of eligenvectors of 六 , which we denote by Ww1 一 e and 2，. .OK: 


天 开 
Y(Wb) 三 于 功 的 Wo 及 的 = >》 zj 的 ao 


J 王 1 


JInserted into (8.12) and expanded into a series of 凡 yields 


见 三 7 +OU， 


and algebraic telations of the form yj = O(P) for7y 过 2. Similarly，we get 
algebraic relations for zej 蚊 = O(P) 证 7 和 上 andthe function ze := 20e 人 
Satisfies 


和 二 Nej)zT+TON wiih 三 GyBCuoj (8.13) 


Where wj is the left eigenvector of 忆 corresponding to the eigenvalue Ge. This is in 
perfect analogy to the computations of Sect.XYV.3.1. 

This analysis can be extended straightforwardly to partitioned general linear 
methods, where different methods are applied to the components y and v of a parti- 
tioned differential equation. Unfortunately, we do not know of any results that would 
extend those of Sect.XYV.0 to general linear methods. 


和 V.9 上 Xercises 


1. Let G1(2z) be the Principal root of the characteristic equation p(0C) 一 za(0) = 0. 
Prove that for irreducible multistep methods the condition 61( 一 2)6G1(z) 三 1 Gin 
aneighbourhood of z = 0) is equivalent to the Symmetry of the method. 

2. (Lambert 儿 Watson 1976). Prove that stable, Symmetric linear multistep meth- 
ods (1.8) for second order differential equations, for which the polynomial p(6) 
has only simple zeros (with the exception of 5 = 1), has a non-vanishing inter- 
val of periodicity, ie., the roots Gi(z) of p(C) ->2c(C) = 0satisfy | 人 (GO)| =1 
for suffticiently small real y. 

Zz7t Simple roots cannot leave the unit circle under Small perturbations of y/. 


010 


10. 


11. 


XV. Dynamics of Multistep Methods 


. Consider a Symmetric, S-Sstable multistep method (1.8). Hit is irreducible (no 


common factors of p(C) and oa(G)), then Kis even. Hence p( 一 1) 夫 0. 


. Using Theorem XII.3.2, prove that the underlying one-step method of a strictly 


Stable rth order linear multistep method has order 7. 


. (Dahlquist 1959). Consider the linear problem y = Xy and apply a Symmetric 


linear multistep method (1.1) as in Example 2.2. Prove that for 上 = jand 
玉 一 0， 
名 


where HMj1 is the growth parameter. 


.Consider a general linear method (8.1). 开 there exist an invertible Symmetric 


matrix C and a diagonal matrix 4 such that 


UDPDIGD-G DGPB -CC74 加 
2 Ce = 
then the method is C-Symplectic. 

ZL Adaptthe proof of Burrage 多 Butcher for 已 -Stability (See Hairer 儿 Wan- 
ner (1990), page 3538). 


. ARunge-Kutta method can be conslidered as a general lnear method with 刀 三 


(1),C = 牙 Prove that the condition (9.1) is equivalent to the symplecticity 
condition of Chap. VI. 


. Extend the definition of C-Symplecticity to partitioned general linear methods， 


and prove that the condition 


开 C 方 开 C 户 了 
We DPIGB--C74 ) -。 0 


BTG 万 -46 BTGB_-474-44 


implies that the method is C-Ssymplectic. 


.Construct general linear methods of order 7 > 2, for which all growth parame- 


ters are positive. Find such methods,， which have a smaller degree of implicit- 
ness than Symmetric one-step methods of the Same order. 
Write a Maple program that checks the coefficients of Iable 7.1. After deftining 
rho:=p(2), use the instructions 

> Sigma := 七 ayYLIor (rho/ (1og(z)*log(z))，z=1y8) 

> factor (expandq (convett (sigma Polynom) ) ) ; 
Construct partitioned general linear methods Which are symmetric, explicit, of 
high order and for which the matrices 六 and 刀 have distinct eigenvalues (with 
the exception of 1). Compared to multistep methods, smaller dimensions of the 
matrices D and 万 are possible. 
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Simplifying assumptions 96 

Sinc function 473, 481 

Singular value decomposition 133 

SL(m), special linear group 119, 130 

5[(m), special linear Lie algebra 119 

small denominators “ 400 

SO(m), special orthogonal group ”119 

50(n), Skew-symmetric matrices 119 

Spherical pendulum 238, 254 

Splitting 

- fast-slow 317 

-Lie-Trotter 47 

-- Marchuk 47 

- of ordered tree 370 

-Strang 47,230 

splitting methods 47,48, 91, 193, 252， 

270, 284, 298, 348 

D-compatibility “ 145 

- negative Steps 82 

- of higher order 82 

- order conditions 80 

Sp(m), Symplectic group 119 

5hb(m), symplectic Lie algebra 119 

Stability 

-linear 23 

- long-term 392 

Stability function 194 

starting approximations ”326 

-order 327 

Step Slze control 

-integrating, reversible 314, 397, 449， 
538 


- proportional, reversible 310,313, 356， 
449 

- Standard 303 

- structure-preserving 310 

Step Size function 308,311 

Stiefel manifold 131 

Stormer-Verlet Scheme “7, 9, 39, 48, 189， 
270, 318, 349, 386, 472, 586 

-as classical lmit 300 

- as composition method 148 

-as Nystrom method 41 

-as processing method ”159 

- as splitting method 48 

-as variational integrator 208 

一 energy conservation 368,513 

- linear error growth 414 

- Symmetry 42, 145 

- Symplecticity 48, 190 

-variable step Size 308, 309,312, 313， 
315 

Strang Splitting 47,230,315, 348 

Structure constants ”280 

Submanifold 109 

- Symplectic 259 


Subtrees 

-ordered “60 

Summation 

- compensated 323 
SUperconvergence ”32, 37,250 
Suzuki's fractals ”45,46, 153 


Switching lemma 70 


Symmetric collocation methods ”146, 176 
Symmetric composition ”94 

-of first order methods ”150 

- of symmetric methods 150, 154 


Symmetric composition methods 149 

- oforder6 156 

-oforder8 157 

- oforder 10 158 

Symmetric Lie group methods 169 

Symmetric methods ”3, 42, 143, 144, 342， 
612 

-explicit 148 

- Symmetric composition “154 

Symmetric methods on manifolds 161 

Symmetric projection 161 

一 existence of numerical solution ” 162 

-non-reversible 166 

Symmetric Runge-Kutta methods “1406， 
176 

Symmetric splitting method 177 

Symmetrized order conditions 177 

Symmetry 289,311,0613 


Index 043 


-- of Gauss methods 147 
-- ofLobatto 147 
- of symmetric local coordinates ”168 


Symmetry coefficient 37,67,72 
Symplectic 183, 196, 241 

-- B-series 217 

- maps 208 


-- P-series 217 

- projection 259 

- Submanifold 25S8, 295 

Symplectic Euler method 4, 48, 189, 193， 
230, 242, 270, 340, 346, 349, 383 

- as Splitting method 48 

一 energy Conservation 308 

- variable step Size 307 

Symplectic methods ”187,012 

- as variational integrators 207 

- based on generating functions ”203 

- irreducible 222 

- Nystrom methods ”194 

- partitioned Runge-Kutta methods ”193， 
208 

-- Runge-Kutta methods ”191 

- variable step Size 306 

Symplectic Submanifold 259 

Symplecticity 244, 385 


Takens chaos 563 

tangent bundle 239 

tangent Space 114, 120 

- parametrization 117, 134 

0-method 147 

-- adjoint 148 

three-body problem 321, 390 

time transformation 306, 3S6 

time-reversible methods ”144 

Toda fow 109 

Toda lattice ”402,414, 440, 449 

total differential 186, 196 

total energy ”9, 18, 21, 98, 479, 484, 510， 
524, 600 

transformations 

- adiabatic 5S31, 32 

- averaging 458 

一 canonical 186 

-ITeverSsibility preserving 438 

- Symplectic 182, 183, 196, 241 

trapezoldal rule 28, 194, 223, 312 

trees 51,217, 369 

一 bi-coloured ”60 

-equivalence class 384 

-ordered 60 

一 oriented free 388 


044 Index 


- Iooted 53 

cor-trees 72 

trigonometric methods 473 
triple jump 44, 46, 153 
true anomaly 9 

two-body problem ”9, 23 
two-force methods 478 


underlying one-step method 573, 609 


Van der Pol's equation ”455 
variational integrators ”204 
variational problem 205, 237 
variational splitting 271 


vector fields 2 

- divergence-free 227 

一 Teversible 143, 144 

Verlet method 7, 39, 48, 189, 270, 318， 
472, 513 

- adaptive 309 

Verlet-Imethod 318, 475 

volume preservation ”105$, 113, 227, 231 

volume-preserving integrators ”228 


weak invariants “109 

work-precision diagrams ， 150, 193, 150， 
157, 334, 336, 482, 604, 605, 608 

人 帮 -transformation 235 


